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MODIFIED FACTOR VIII 



CROSS REFERENCE TO RELATED APPLICATIONS 
This application claims priority from United States 
0 Patent Application No. 09/037,601 FILED March 10, 1998. 

ACKNOWLEDGEMENT OF FEDERAL RESEARCH SUPPORT 
The government has rights in this invention arising from 
National Institutes of Health Grant Nos . HL40921, HL46215, and 
L5 HL3 6 0 94 that partially funded the research leading to this 

invention. 

BACKGROUND OF THE INVENTION 
This invention relates generally to a hybrid factor VIII 
20 having human and animal factor VIII amino acid sequence or 

having human factor VIII and non-factor VIII amino acid 
sequence and methods of preparation and use thereof. 

Blood clotting begins when platelets adhere to the cut 
25 wall of an injured blood vessel at a lesion site. 

Subsequently, in a cascade of enzymatically regulated 
reactions, soluble fibrinogen molecules are converted by the 
enzyme thrombin to insoluble strands of fibrin that hold the 
platelets together in a thrombus. At each step in the 
30 cascade, a protein precursor is converted to a protease that 

cleaves the next protein precursor in the series. Cof actors 
are required at most of the steps. 

Factor VIII circulates as an inactive precursor in blood, 
3 5 bound tightly and non-covalently to von Willebrand factor. 

Factor VIII is proteolytically activated by thrombin or factor 



BNSDOCID: <WO 994627uAl_L> 



WO 99/46274 



PCT/US99/05193 



Xa, which dissociates it from von Willebrand factor and 
activates its procoagulant function in the cascade. In its 
active form, the protein factor 1 ' "Villa is a cof actor that 
increases the catalytic efficiency of factor IXa toward factor 
X activation by several orders of magnitude. 

People with deficiencies in factor VIII or antibodies 
against factor VIII who are not treated with factor VIII 
suffer uncontrolled internal bleeding that may cause a range 
of serious symptoms, from inflammatory reactions in joints to 
early death. Severe hemophiliacs, who number about 10,000 in 
the United States, can be treated with infusion of human 
factor VIII, which will restore the blood's normal clotting 
ability if administered with sufficient frequency and 
concentration. The classic definition of factor VIII, in 
fact, is that substance present in normal blood plasma that 
corrects the clotting defect in plasma derived from 
individuals with hemophilia A. 

The development of antibodies ("inhibitors" or 
"inhibitory antibodies") that inhibit the activity of factor 
VIII is a serious complication in the management of patients 
with hemophilia. Autoantibodies develop in approximately 20% 
of patients with hemophilia A in response to therapeutic 
infusions of factor VIII. In previously untreated patients 
with hemophilia A who develop inhibitors, the inhibitor 
usually develops within one year of treatment . Additionally, 
autoantibodies that inactivate factor VIII occasionally 
develop in individuals with previously normal factor VIII 
levels. If the inhibitor titer is low enough, patients can be 
managed by increasing the dose of factor VIII. However, often 
the inhibitor titer is so high that it cannot be overwhelmed 
by factor VIII. An alternative strategy is to bypass the need 
for factor VIII during normal hemo stasis us±ng^ factor IX 
complex preparations (for example, KONYNE®, Proplex®) .or 
recombinant human factor Villa. Additionally, since porcine 
factor VIII usually has substantially less reactivity with 
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inhibitors than human factor VIII, a partially purified 
porcine factor VIII preparation. (HYATE : C®) is used. Many- 
patients who have developed inhibitory antibodies to human 
factor VIII have been successfully treated with porcine factor 
VIII and have tolerated such treatment for long periods of 
time. However, administration of porcine factor VIII is not 
a complete solution because inhibitors may develop to porcine 
factor VIII after one or more infusions. 

Several preparations of human plasma-derived factor VIII 
of varying degrees of purity are available commercially for 
the treatment of hemophilia A. These include a partially- 
purified factor VIII derived from the pooled blood of many 
donors that is heat- and detergent- treated for viruses but 
contain a significant level of antigenic proteins; a 
monoclonal antibody-purified factor VIII that has lower levels 
of antigenic impurities and viral contamination; and 
recombinant human factor VIII, clinical trials for which are 
underway. Unfortunately, human factor VIII is unstable at 
physiologic concentrations and pH, is present in blood at an 
extremely low concentration (0.2 /xg/ml plasma), and has , ..low 
specific clotting activity. 

Hemophiliacs require daily replacement of factor VIII to 
prevent bleeding and the resulting deforming hemophilic 
arthropathy. However, supplies have been inadequate and 
problems in therapeutic use occur due to difficulty in 
isolation and purification, immunogenic ity, and the necessity 
of removing the AIDS and hepatitis infectivity risk. The use 
of recombinant human factor VIII or partially-purified porcine 
factor VIII will not resolve all the problems. 

The problems associated with the commonly used, 
commercially available, plasma -derived factor VIII have 
stimulated significant interest in the development of a better 
factor VIII product. There is a need for a more potent factor 
VIII molecule so that more units of clotting activity can be 
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delivered per molecule; a factor VIII molecule that is stable 
at a selected pH. and. physiologic concentration; a factor VIII 
molecule that is less apt to cause production of inhibitory 
antibodies; and a factor VIII molecule that evades immune 
detection in patients who have already acquired antibodies to 
human factor VIII. 

It is therefore an object of the present invention to 
provide a factor VIII that corrects hemophilia in a patient 
deficient in factor VIII or having inhibitors to factor VIII. 

It is a further object of the present invention to 
provide methods for treatment of hemophiliacs. 

It is still another object of the present invention to 
provide a factor VIII that is stable at a selected pH and 
physiologic concentration. 

It is yet another object of the present invention to 
provide a factor VIII that has greater coagulant activity than 
human factor VIII. 

It is an additional object of the present invention to 
provide a factor VIII against which less antibody is produced. 

. -.. . .SUMMARY OF THE INVENTION 
The present invention provides isolated, purified, hybrid 
factor VIII molecules and fragments thereof with coagulant 
activity including hybrid factor VIII having factor VIII amino 
acid sequence derived from human and pig or other non-human 
mammal (together referred to herein as "animal"); or in a 
second embodiment including a hybrid equivalent factor VIII 
having factor VIII amino acid sequence derived from human or 
animal or both and amino acid sequence havingp no known 
sequence identity to factor VIII* ( "non-factor VIII amino acid 
sequence") , preferably substituted in an antigenic and/or 
immunogenic region of the factor VIII, is described. One 
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skilled in the art will realize that numerous hybrid factor 
VIII constructs :can be prepared including, but not limited to, 
human/animal factor VIII having greater coagulant activity 
than human factor VIII ("superior coagulant activity"); non- 
immunogenic human/equivalent factor VIII; non-antigenic 
human/equivalent or human/animal factor VIII ;- non -immunogenic 
human/animal or human/equivalent factor VIII having superior 
coagulant activity; non-antigenic human/animal or 
human/animal/equivalent factor VIII having superior coagulant 
activity; non- immunogenic , non-antigenic human/equivalent or 
human/equivalent/animal factor VIII; and non- immunogenic , non- 
antigenic human/animal/equivalent factor VIII having superior 
coagulant activity. 

The hybrid factor VIII molecule is produced by isolation 
and recombination of human and animal factor VIII subunits or 
domains; or by genetic engineering of the human and animal 
factor VIII genes. 

In a preferred embodiment, recombinant DNA methods are 
used to substitute elements of animal factor VIII for. the 
corresponding elements of human factor VIII, resulting in 
hybrid human/animal factor VIII molecules. In a second 
preferred embodiment, recombinant DNA methods are used to 
replace one or more amino acids in the human or animal factor 
VIII or in a hybrid human/animal factor VIII with amino acids 
that have no known sequence identity to factor VIII, 
preferably a sequence of amino acids that has less 
immunoreactivity with naturally occurring inhibitory 
antibodies to factor VIII ( "nonantigenic amino acid sequence") 
and/or is less apt to elicit the production of antibodies to 
factor VIII ( "non -immunogenic amino acid sequence") than human 
factor VIII. An example of an amino acid sequence that can be 
used to replace immunogenic or antigenic sequence is- a 
sequence of alanine residues. 
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In another embodiment, subunits of factor VIII are 
isolated and purified from human or; -.animal plasma, and hybrid 
human/animal factor VIII is produced either by mixture of 
animal heavy chain subunits with human light chain subunits or 
by mixture of human heavy chain subunits with animal light 
chain subunits, thereby producing human light chain/animal 
heavy chain and human heavy chain/animal light chain hybrid 
molecules. These hybrid molecules ar t e isolated by ion 
exchange chromatography. 

Alternatively, one or more domains or partial domains of 
factor VIII are isolated and purified from human or animal 
plasma, and hybrid human/animal factor VIII is produced by 
mixture of domains or partial domains from one species with 
domains or partial domains of the second species. Hybrid 
molecules can be isolated by ion exchange chromatography. 

Methods for preparing highly purified hybrid factor VIII 
are described having the steps of: (a) isolation of subunits 
of plasma-derived human factor VIII and subunits . of plasma- 
derived animal factor VIII, followed by reconstitution of 
coagulant activity by mixture of .human' and animal subunits, 
followed by isolation of hybrid human/animal factor VIII by 
ion exchange chromatography; (b) isolation of domains or 
partial domains of plasma-derived human factor VIII and 
domains or partial domains of plasma-derived animal factor 
VIII, followed by reconstitution of coagulant activity by 
mixture of human and animal domains, followed by isolation of 
hybrid human/animal factor VIII by ion exchange 
chromatography; (c) construction of domains or partial domains 
of animal factor VIII by recombinant DNA technology, and 
recombinant exchange of domains of animal and- human factor 
VIII to produce hybrid human/animal factor VIII with coagulant 
activity; (d) creation of hybrid human/animal factor VIII by 
replacement of specific amino acid residues of the factor VIII 
of one species with the corresponding unique amino acid 
residues of the factor VIII of the other species; or (e) 
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creation of a hybrid equivalent factor VIII molecule having 
human or animal amino acid sequence or both, in which specific 
amino acid residues of the factor VIII are replaced with amino 
acid residues having no known sequence identity to factor VIII 
5 by site-directed mutagenesis. 

The determination of the entire DNA sequence encoding 
porcine factor VIII set forth herein has enabled, for the 
first time, the synthesis of full-length porcine factor VIII 

10 by expressing the DNA encoding porcine factor VIII in a 

suitable host cell. Purified recombinant porcine factor VIII 
is therefore an aspect of the present invention. The DNA 
encoding each domain of pcrcine factor VIII as well as any 
specified fragment thereof, can be similarly expressed, either 

15 by itself or in combination with DNA encoding human factor 

VIII to make the hybrid human/porcine factor VIII described 
herein. Furthermore, porcine fVIII having all or part of the 
B domain deleted (B-doraainless porcine fVIII) is made 
*~ v available as part of the present invention, by expression DNA 

20 ' - encoding porcine fVIII having a deletion of one or more codons 
— r of the B -domain. . 

Some embodiments of hybrid or hybrid equivalent factor 
VIII have specific activity greater than that of human factor 
25 VIII and equal to or greater than that of porcine factor VIII- 

Some embodiments of hybrid or hybrid equivalent factor VIII 
have equal or less immunoreactivity with inhibitory antibodies 
to factor VIII and/or less imraunogenicity in humans or 
animals, compared to human or porcine factor VIII. 

30 

Also provided are pharmaceutical compositions and methods 
for treating patients having factor VIII deficiency comprising 
administering the hybrid or hybrid equivalent factor VIII. 

35 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1A-1H taken together provide an aligned sequence 
comparison of the human,' pig and mouse factor VIII acid 
sequences . 

DETAILED DESCRIPTION OF THE INVENTION 
Unless otherwise specified or indicated, as used herein, 
"factor VIII" denotes any functional factor VIII protein 
molecule from any animal, any hybrid factor VIII or modified 
factor VIII, "hybrid factor VIII" or "hybrid protein" denotes 
any functional factor VIII protein molecule or fragment 
thereof comprising factor VIII amino acid sequence from human, 
porcine, and/or non-human, non-porcine mammalian species. 
Such combinations include, but are not limited to, any or all 
of the following hybrid factor- VIII molecules or fragments 
thereof: (1) human/porcine; (2) human/non- human, non-porcine 
mammalian, such as human/mouse; (3) porcine /non- human, non- 
porcine mammalian, such as mouse/dog. Such combinations also 
include hybrid factor VIII equivalent molecules or fragments 
thereof, as further defined below, comprising factor "VIII 
amino acid sequence of hybrid, human, porcine, or non -human, 
non-porcine mammalian origin in which amino acid sequence 
having no known sequence identity to factor VIII is 
substituted. Such hybrid combinations also include hybrid 
factor VIII amino sequence derived from more than two species, 
such - as human/pig/mouse, or from two or more species in which 
amino acid sequence having no known sequence identity to 
factor VIII is substituted. Unless otherwise indicated, 
"hybrid factor VIII" includes fragments of the hybrid factor 
VIII, which can be used, as described below in one exemplary 
embodiment, as probes for research purposes or as diagnostic 
reagents . 

As used herein, "mammalian factor VIII" includes factor 
VIII with amino acid sequence derived from any non-human 
mammal, unless otherwise specified. "Animal", as used herein, 
refers to pig and other non-human mammals. 

8 
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A "fusion protein" or "fusion factor VIII or fragment 
thereof", as used herein, is the product of a hybrid gene in 
which the coding sequence for one protein is extensively- 
altered, for example, by fusing part of it to the coding 
sequence for a second protein from a different gene to produce 
a hybrid gene that encodes the fusion protein: As used 
herein, a fusion protein is a subset of the hybrid factor VIII 
protein described in this application. 

A "corresponding" nucleic acid or amino acid or sequence 
of either, as used herein, is one present at a site in a 
factor VIII or hybrid factor VIII molecule or fragment thereof 
that has the same structure and/or function as a site in the 
factor VIII molecule of another species, although the nucleic 
acid or amino acid number may not be identical. A sequence 
"corresponding to" another factor VIII sequence substantially 
corresponds to such sequence, and hybridizes to the sequence 
of the designated SEQ ID NO. under stringent conditions. A 
se q uence "corresponding to" another factor VIII sequence also 
includes a sequence that results in the expression of a factor 
VIII or claimed procoagulant hybrid factor VIII or fragment 
thereof and would hybridize to the designated SEQ ID NO. but 
for the redundancy of the genetic code. 

A "unique" amino acid residue or sequence, as used 
herein, refers to an amino acid sequence or residue in the 
factor VIII molecule of one species that is different from the 
homologous residue or sequence in the factor VIII molecule of 
another species . 

"Specific activity," as used herein, refers to the 
activity that will correct the coagulation defect of human 
factor VIII deficient plasma. Specific activity is measured 
in units' of clotting activity per milligram total factor VIII 
protein in a standard assay in which the clotting time of 
human factor VIII deficient plasma is compared to that of 
normal human plasma. One unit of factor VIII activity is the 
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activity present in one milliliter of normal human plasma. In 
the assay,, the shorter the time : ;for clot formation, the 
greater the activity of the. factor VIII being assayed. Hybrid 
human/porcine factor VIII has coagulation activity in a human 
factor VIII assay. This activity, as well as that of other 
hybrid or hybrid equivalent factor VIII molecules or fragments 
thereof, may be less than, equal to, or greater than that of 
either plasma-derived or recombinant human factor VIII. 

The human factor VIII cDNA nucleotide and predicted amino 
acid sequences are shown in SEQ ID NOs : 1 and 2, respectively. 
Factor VIII is synthesized as an approximately ,300 kDa single 
chain protein with internal sequence homology that defines the 
"domain" sequence 3MH 2 -Al-A2-B-A3-Cl-C2-COOH. In a factor VIII 
molecule, a "domain", as used herein, is a continuous sequence 
of amino acids that is defined by internal amino acid sequence 
identity and sites of proteolytic cleavage by, thrombin. 
Unless otherwise specified, factor VIII domains include the 
following amino acid residues, when the sequences are aligned 
with the human amino acid sequence (SEQ ID N0:2) : Al, 
residues Alal-Arg372; A2 , residues Ser373-Arg740 ; B, residues 
Ser741-Argl648 ; A3, residues Serl690 - Ile2032 ; CI, residues 
Arg2033-Asn2172 ; C2 , residues Ser2173 -Tyr2332 . The A3-C1-C2 
sequence includes residues Serl690-Tyr2332 . The remaining 
sequence, residues Glul649-Argl689 , is usually referred to as 
the factor VIII light chain activation peptide. Factor VIII 
as ? proteolytically activated by thrombin or factor Xa, which 
dissociates it from von Willebrand factor, forming factor 
Villa, which has procoagulant function. The biological 
function of factor Villa is to increase the catalytic 
efficiency of factor IXa toward factor X activation by several 
orders of .magnitude . Thrombin-activated factor Villa is a 160 
kDa A1/A2/A3 -C1-C2 heterotrimer - that forms a complex with 
factor . IXa and factor X on the surface of platelets^ or 
monocytes. A "partial domain" as used herein is a continuous 
sequence of amino acids forming part of a domain. 
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"Subunits" of* human or animal factor VIII, as used 
herein, are the heavy and light chains of the protein. The 
heavy chain of factor VIII contains three domains, Al , A2 , and 
B. The light chain of factor VIII also contains three 
domains, A3, CI, and C2 . 

The hybrid factor VIII or fragment thereof can be made 
(1) by substitution of isolated, plasma-derived animal 
subunits or human subunits (heavy or light chains) for 
corresponding human subunits or animal subunits; (2) by 
substitution of human domains or animal domains (Al , A2 , A3, 
B, CI, and C2) for corresponding animal domains or human 
domains; (3) by substitution of parts of human domains or 
animal domains for parts of animal domains or human domains; 
(4) by substitution of at least one specific sequence 
including one or more unique human or animal amino acid(s) for 
the corresponding animal or human amino acid(s); or (5) by 
"substitution of amino acid sequence that has no known sequence 
identity to factor VIII for at least one sequence including 
one or more specific amino acid residue (s) in human, animal, 
■•6r hybrid factor VIII or fragments thereof. A "B-domainless" 
-hybrid factor VIII, hybrid equivalent factor VIII, or fragment 
of either, as used herein, refers to any one of the hybrid 
factor VIII constructs described herein that lacks the B 
domain. 

The terms "epitope" , "antigenic site", and "antigenic 
determinant", as used herein, are used synonymously and are 
defined as a portion of the human, animal, hybrid, or hybrid 
equivalent factor VIII or fragment thereof that is 
specifically recognized by an antibody. It can consist of any 
number of amino acid residues, and it can be dependent upon 
the primary, secondary, or tertiary structure of the protein. 
In accordance with this disclosure, a hybrid factor VIII, 
hybrid factor VIII equivalent, or fragment of either that 
includes at least one epitope may be used as a reagent in the 
diagnostic assays described below. In some embodiments, the 
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hybrid or .hybrid equivalent factor VIII or fragment thereof is 
not cross-reactive or is less cross-reactive with all 
naturally occurring inhibitory factor VIII antibodies than 
human or porcine factor VIII. 

-The term "immunogenic site", as used herein, is defined 
as a region of the human cr animal factor VIII, hybrid or 
hybrid equivalent factor VIII, or fragment thereof that 
specifically elicits the production of antibody to the factor 
VIII, hybrid, hybrid equivalent, or fragment in a human or 
animal, as measured by routine protocols, such as immunoassay, 
e.g. ELISA, or the Bethesda assay, described herein. It can 
consist of any number of amino acid residues, and it can be 
dependent upon the primary, secondary,, or tertiary structure 
of the protein. In some embodiments, the hybrid or hybrid 
equivalent factor VIII or fragment thereof is nonimmunogenic 
or less immunogenic In an animal or human than human or 
porcine factor VIII. 

. As used herein, a "hybrid factor VIII equivalent molecule 
or fragment thereof" or "hybrid equivalent factor VIII or 
.fragment thereof" is an active factor VIII or hybrid factor 
VIII molecule or fragment thereof comprising at least one 
sequence including one or more amino acid residues that have 
no known identity to human or animal factor VIII sequence 
substituted for at least one sequence including one or more 
specific .amino acid residues in the human, animal, or hybrid 
factor VIII or fragment thereof. The sequence of one or more 
amino acid residues that , have no known identity to human or 
animal factor VIII sequence is also referred to herein as 
."non-factor VIII amino acid sequence" . In a preferred 
embodiment, the amino acid(s) having no known sequence 
identity to .factor. VIII sequence are alanine residues. In 
another preferred, embodiment, the specific factor VIII 
sequence for which the amino acid(s) having no known sequence 
identity to. factor VIII sequence are substituted . includes an 
antigenic site, that is immunoreac.tive with naturally occurring 
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factor VIII inhibitory antibodies, s\ich that the resulting 
hybrid factor VIII equivalent molecule or fragment thereof is 
less immunoreactive or not immunoreactive with factor VIII 
inhibitory antibodies. In yet another preferred embodiment, 
the specific hybrid factor VIII sequence for which the amino 
acid(s) having no known sequence identity to factor VIII 
sequence are substituted includes an immunogenic site that 
elicits the formation of factor VIII inhibitory antibodies in 
an animal or human, such that the resulting hybrid factor VIII 
equivalent molecule or fragment thereof is less immunogenic. 

"Factor VIII deficiency, " as used herein, includes 
deficiency in clotting activity caused by production of 
defective factor VIII, by inadequate or no production of 
factor VIII, or by partial or total inhibition of factor VIII 
by inhibitors. Hemophilia A is a type of factor VIII 
deficiency resulting from a defect in an X-linJced gene and the 
absence or deficiency of the factor VZII protein it encodes. 

As used herein, "diagnostic assays" include assays that 
in some manner utilize the antigen-axitibody interaction to 
detect and/or quantify the amount of a particular antibody 
that is present in a test sample to assist in the selection of 
medical therapies . There are many such assays known to those 
of skill in the art. As used herein, however, the hybrid or 
hybrid equivalent factor VIII DNA or fragment thereof and 
protein expressed therefrom, in whole or in part, can be 
substituted for the corresponding reagents in the otherwise 
known assays, whereby the modified assays may be used to 
detect and/or quantify antibodies to factor .VIII. It is the 
use of these reagents, the hybrid or hybrid equivalent factor 
VIII DNA or fragment thereof or protein expressed therefrom, 
that permits modification of known assays for detection of 
antibodies to human or animal factor VIII or to hybrid 
human/animal factor VIII. Such assays include, but are not 
limited to ELISAs, immunodiffusion assays, and immunoblots. 
Suitable methods for practicing any of these assays are known 
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to those of skill in the art . As used herein, the hybrid or 
hybrid equivalent factor • VIII- or fragment thereof that 
- ■ includes at least one epitope of the protein can be used as 
the diagnostic reagent. Examples of other assays in which the 
5; hybrid or hybrid equivalent factor VIII or fragment thereof 

can be used include the Bethesda assay and anticoagulation 
assays. 

GENERAL DESCRIPTION OF METHODS 
10 U.S. Serial No. 07/864,004 described the discovery of 

hybrid human/porcine factor VIII molecules having coagulant 
activity, in which elements of the factor VIII molecule of 
human or pig are substituted for corresponding elements of the 
factor VIII molecule of the other species. U.S. Serial No. 
15 08/212,133 and PCT/US94/13200 describe procoagulant hybrid 

human/animal and hybrid equivalent factor VIII molecules, in 
which elements of the factor VIII molecule of one species are 
substituted for corresponding elements of the factor VIII 
molecule of the other species. 

20 

■ The present, invention . provides * hybrid human/animal, 
animal/animal, and equivalent factor VIII. molecules and 
fragments thereof, and the nucleic acid sequences encoding 
such hybrids, some of which have greater coagulant activity in 

25 a standard clotting assay when compared , to highly-purified 

human factor VIII; and/or are . less immunoreactive to 
inhibitory antibodies to, human or porcine factor VIII than 
human or porcine factor VIII; and/or are less immunogenic in 
a human or animal than human or porcine factor VIII. These 

30 hybrid factor VIII molecules can be constructed as follows.. 

At least five types- of - active hybrid human/porcine or 
hybrid equivalent factor VIII molecules or fragments thereof, 
the nucleic acid sequences encoding these hybrid factor VIII 
35 molecules, and the methods for preparing them are disclosed 

-herein: those obtained :{1) by substituting a human or porcine 
subunit .(i.e., heavy chain ■ or- light chain); for the 
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corresponding porcine or human subunit; (2) by substituting 
one or more human or porcine domain(s) (i.e., Al, A2 , A3, B, 
CI, and C2) for the corresponding porcine or human domain (s) ; 
(3) by substituting a continuous part of one or more human or 
5 porcine domain (s) for the corresponding part of one or more 

porcine or human domain(s); (4) by substituting at least one 
specific sequence including one or more unique amino acid 
residue (s) in human or porcine factor VIII for the 
corresponding porcine or human sequence; and (5) by 
10 substituting at least one sequence including one or more amino 

acid residue (s) having no known sequence identity to factor 
VIII ("non-factor VIII amino acid sequence") for at least one 
specific sequence of one or more amino acids in human, 
porcine, or hybrid human/porcine factor VIII. 

15 

At least five types of active hybrid human/non- human, 
non-porcine mammalian or hybrid equivalent factor VIII 
molecules or fragments thereof, and the nucleic acid sequences 
encoding them, can also be prepared by the same methods: 

20 those obtained (!) by substituting a human or non-human, non- 

porcine mammalian subunit (i.e., heavy chain or light chain) 
for the corresponding non-human, non-porcine mammalian or 
human subunit; (2) by substituting one or more human or non- 
human, non-porcine mammalian domain(s) (i.e., Al , A2 , A3, B, 

25 CI and C2 ) for the corresponding non-human, non-porcine 

mammalian or human domain (s) ; (3) by substituting a continuous 
part of one or more human or non- human, non-porcine mammalian 
domain (s) for the corresponding part of one or more non-human, 
non-porcine mammalian or human domain (s) ; (4) by substituting 

3 0 at least one specific sequence including one or more unique 

amino acid residue (s) in human or non-human, non-porcine 
mammalian factor VIII for the corresponding non-human, non- 
porcine mammalian or human sequence; and (5) by substituting 
at least one sequence including one or more amino aeid 

35 residue (s) having no known sequence identity to factor VIII 

("non-factor VIII amino acid sequence 11 ) for at least one 
specific sequence of one or more amino acids in human, non- 
15 
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human, non-porcine mammalian, or hybrid human/non- human, non- 
porcine mammalian* f actor ..VIII . 

Further, one skilled in the art will readily recognize 
that the same methods can be used to prepare at least five 
types of active hybrid -factor VIII molecules or fragments 
thereof, corresponding to types (1) - (5) in the previous two 
paragraphs-, comprising factor VIII amino acid sequence from 
two or more non-human mammals, such as porcine /mouse , and 
further comprising non-factor VIII amino acid sequence. 

Hybrid human/animal, animal/animal, and equivalent factor 
VIII proteins or fragments thereof listed above under groups 
(l)-(3) are made by isolation of subunits, domains, or 
continuous parts of domains of plasma-derived factor VIII, 
followed by reconstitution and purification. Hybrid 
human/animal, animal/animal, and equivalent factor VIII 
proteins or fragments thereof described under groups (3) -(5) 
above are made by recombinant DNA methods . The hybrid 
molecule may contain a greater or lesser percentage of human 
than animal sequence, depending, on the origin of the various 
regions, as described in more detail below. 

Since current information indicates that the B domain has 
no inhibitory epitope and has no known effect on factor VIII 
function, in some embodiments the B domain . is deleted in the 
active hybrid or hybrid equivalent factor VIII molecules or 
fragments thereof ("B(-) factor VIII 11 ) prepared by any of the 
methods described herein. 

It is shown in Example 4 that hybrid human/porcine factor 
VIII comprising porcine heavy chain and human light chain and 
corresponding to the first type of hybrid listed above has 
greater specific coagulant activity in a standard clotting 
assay compared to human factor VIII. The hybrid human/animal 
or equivalent factor VIII with coagulant activity, whether the 
activity is higher/ equal to, or lower than that of human 
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factor VIII, can be useful in treating patients with 
inhibitors, since these inhibitors can react less with hybrid 
human/animal or equivalent factor VIII than with either human 
or porcine factor VIII. 

Preparation of hybrid factor VIII molecules from isolated 
human and animal factor VIII subunits by reconstitution : 

The present invention provides hybrid human/animal factor 
VIII molecules or fragments thereof, with subunit 
substitutions, the nucleic acid sequences encoding these 
hybrids, methods for preparing and isolating them, and methods 
for characterizing their procoagulant activity. One method, 
modified from procedures reported by Fay, P. J. et al . (1990) 
J". Biol. Chem. 265 : 6197 ; and Lollar, J.S. et al. (1988) J". 
Biol. Chem. 263 : 10451 , involves the isolation of subunits 

(heavy and light chains) of human and animal factor VIII, 
followed by recombination of human heavy chain and animal 
light chain or by recombination of human light chain and 
animal heavy chain. 

« 

Isolation of both human and animal individual subunits 
involves dissociation of the light chain/heavy chain dimer . 
This is accomplished, for example, by chelation of calcium 
with ethylenediaminetetraacetic acid (EDTA) , followed* by 
monoS™ HPLC ( Pharmacia -LKB , Piscataway, NJ) . Hybrid 
human/animal factor VIII molecules are reconstituted from 
isolated subunits in the presence of calcium. Hybrid human 
light chain/animal heavy chain or animal light chain/human 
heavy chain factor VIII is isolated from unreacted heavy 
chains by monoS™ HPLC by procedures for the isolation of 
porcine factor VIII, such as described by Lollar, J.S. et al . 
(1988) Blood 71 : 137-143 . 

These methods, used in one embodiment to prepare active 
hybrid human/porcine factor VIII, described in detail in the 
examples below, result in hybrid human light chain/porcine 
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heavy chain molecules with greater than six times the 
procoagulant activity of human factor VIII. 

Other hybrid human/non- human, non-porcine mammalian 
factor VIII molecules can be prepared, isolated, and 
characterized for activity by the same methods . One skilled 
in the art will readily recognize that these methods can also 
be used to prepare, isolate,, and characterize for activity 
hybrid animal/animal factor VIII, such as porcine/mouse, 
comprising the light or heavy chain or one species is combined 
with the heavy or light chain of the other species . 

Preparation of hybrid factor VIII molecules from isolated 
human and animal factor VIII domains by reconstitution : 

The present invention provides hybrid human/animal factor 
VIII molecules or fragments thereof with domain substitutions, 
the nucleic acid sequences encoding them, methods for 
preparing and isolating them, and methods for characterizing 
their procoagulant activity. One method involves the 

isolation of one or more domains of human and one or more 
domains of animal factor VIII,. followed by recombination of 
human and animal domains to form hybrid human/animal factor 
VIII with coagulant activity, as described by Lollar, P. et 
al. (Nov. 25, 1992) J. Biol. Chem. 267(33) : 23652-23657 , for 

hybrid human/porcine factor VIII . . 

Specifically provided is a hybrid human/porcine factor 
VIII with substitution o^f the porcine A2 domain for the human 
A2 domain, which embodiment illustrates a method by which 
domain- substituted hybrid human/non- human, non-porcine 
mammalian factor VIII can be constructed., Plasma-derived non- 
human, non-porcine mammalian and human A1/A3-C1-C2 dimers are 
isolated by dissociation of the A2 domain from factor Villa. 
This is accomplished, for example, in the presence of NaOH, 
after which the mixture is diluted and the dimer is eluted 
using monoS™ HPLC ( Pharmacia - LKB , Piscataway, NJ) . The A2 
domain is isolated from factor Villa as a minor component in 



18 



WO 99/46274 



PCT/US99/05193 



the monoS™ HPLC . Hybrid human/animal factor VIII molecules 
are reconstituted by mixing equal volumes of the A2 domain of 
one species and the A1/A3-C1-C2 dimer of the other species. 

Hybrid human/animal factor VIII or fragments thereof with 
one or more domain substitutions is isolated from the mixture 
of unreacted dimers and A2 by monoS™ HPLC by procedures for 
the isolation of porcine factor VIII, as described by Lollar, 
J.S. et al . (1988) Blood 72: 137-143 . Routine methods can also 
be used to prepare and isolate the Al , A3, CI, C2 , and B 
domains of the factor VIII of one species, any one or more of 
which can be substituted for the corresponding domain in the 
factor VIII of the other species. One skilled in the art will 
readily recognize that these methods can also be used to 
prepare, isolate, and characterize for activity domain- 
substituted hybrid animal/animal factor VIII, such as 
porcine /mouse . 

These methods, described in detail in the examples below, 
result in hybrid factor VIII molecules with procoagulant 
activity. : 

Preparation of hybrid factor VIII molecules by 

recombinant engineering of the sequences encod ing human, 

animal, and hybrid factor VIII subunits, d &mains, or 

parts of domains: 

Substitution of subunits, domains, conti nuous parts of 
domains : 

The present invention provides active, recombinant hybrid 
human/animal and hybrid equivalent factor VIII molecules and 
fragments thereof with subunit, domain, and amino acid 
sequence substitutions, the nucleic acid sequences encoding 
these hybrids , methods for preparing and isolating them, and 
methods for characterizing their coagulant, immunoreact ive , 
and immunogenic properties. 
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• The human factor VIII gene was- isolated and expressed in 
mammalian cells, as . reported by Toole, J.J. et al . (1984) 
Nature ,212:342-347 (Genetics Institute) ; ; Gitschier, J. et 
al.(1984) Nature 3.12:326-330 (Genentech) ; Wood, W.I. et al . 

(1984) Nature 312:330-337 (Genentech); Vehar, G.A. et al . 

(1984) Nature ^12:337-342 (Genentech); WO 87/04187; WO 
88/08035; WO 88/03558; U.S. Patent No . 4 , 757 , 006 , and the 
amino acid sequence was deduced from cDNA . U.S. Patent No. 
4,965,199 to Capon et al . discloses a recombinant DNA method 
for producing factor VIII in mammalian host cells and 
purification of human factor VIII. Human factor VIII 
expression on CHO (Chinese hamster ovary) cells and BHKC (baby 
hamster kidney cells) has been reported. Human factor VIII 
has been modified to delete part or all of the B domain (U.S. 
Patent No. 4,868,112), and replacement of the human factor 
VIII B domain with the human factor V B domain has been 
attempted (U.S. Patent No. 5,004,803). The cDNA sequence 
encoding human factor VIII and predicted amino acid sequence 
are shown in SEQ ID NOs : 1 and 2, respectively.. 

Porcine factor VIII has been isolated and purified from 
plasma [Fass, D.N. et al . (1982) Blood 59. : 594] . Partial amino 

acid sequence of porcine factor VIII corresponding to portions 
of the N- terminal light chain sequence having homology to 
ceruloplasmin and coagulation factor V and largely incorrectly 
located were described by Church, et al . : (1984) Proc . Natl. 

Acad: Sci. USA 81:6934. Toole, J.J. et al . (1984) Nature 

312 :342-347 described the partial sequencing of the N-terminal 
end of four amino acid fragments of porcine factor VIII but 
did not characterize the fragments as to their positions in 
the factor VIII molecule." " The amino acid sequence of the B 
and part of the A2 domains of * porcine factor VIII were 
reported by Toole, J.J; et al . (1986) Proc. Natl. Acad. Sci, 

USA 83.: 593 9-594 2 . The cDNA sequence encoding the complete A2 

domain of porcine factor VIII and . predicted amino acid 
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sequence and hybrid human/porcine factor VIII having 
substitutions of all domains, all subunits, and specific amino 
acid sequences were disclosed in U.S. Serial No. 07/864,004 
entitled "Hybrid Human/Porcine factor VIII" filed April 7, 
19 92 by John S. Lollar and Marschall S. Runge, which issued as 
U.S. Patent No. 5,364,771 on November 15, 1994, and in WO 
93/20093. The cDNA sequence encoding the A2 domain of porcine 
factor VIII having sequence identity to residues 373-740 in 
mature human factor VIII, as shown in SEQ ID NO : 1 , and the 
predicted amino acid sequence are shown in SEQ ID NOs : 3 and 4, 
respectively. More recently, the nucleotide and corresponding 
amino acid sequences of the Al and A2 domains of porcine 
factor VIII and a chimeric factor VIII with porcine Al and/or 
A2 domains substituted for the corresponding human domains 
were reported in WO 94/11503. 

Both porcine and human factor VIII are isolated from 
plasma as a two subunit protein. The subunits, known as the 
heavy chain and light chain, are held together by a non- 
covalent bond that requires calcium or other divalent metal 
ions. The heavy chain of factor VIII contains three domains, 
Al, A2, and B, which are linked covalently. The light chain 
of factor VIII also contains three domains, designated A3, CI, 
and C2 . The B domain has no known biological function and-can 
be removed from the molecule proteolytically or by recombinant 
DNA technology methods without significant alteration in any 
measurable parameter of factor VIII. Human recombinant factor 
VIII has a similar structure and function to plasma-derived 
factor VIII, though it is not glycosylated unless expressed in 
mammalian cells. 

Both human and porcine activated factor VIII ("factor 
Villa") have three subunits due to cleavage of the heavy chain 
between the Al and A2 domains. This stmcture is designated 
A1/A2/A3-C1-C2 . Human factor Villa is not stable under the 
conditions that stabilize porcine factor Villa, presumably 
because of the weaker association of the A2 subunit of human 
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. factor Villa Dissociation of the A2 subunit of human and 
porcine factor Villa is associated with loss of activity in 
the factor Villa molecule. 

Using as probes the known sequence of parts of the 
porcine factor VIII molecule, the domains of the porcine 
factor VIII molecule that have not been sequenced to date can 
be sequenced by standard, established cloning techniques, such 
as those described in Weis, J.H. , "Construction of recombinant 
DNA libraries, " in Current Protocols in Molecular Biology , 
F.M. Ausubel et al . , eds . (1991); and Sambrook, J. , et al . , 
Molecular Cloning, A Laboratory Manual , so that full length 
hybrids can be constructed. 

Specifically provided as an exemplary and a . preferred 
embodiment is active recombinant hybrid human/porcine factor 
VIII having substituted A2 domain, the nucleic acid sequence 
encoding it, and the methods for preparing, isolating, and 
characterizing its activity. The methods by which this hybrid 
construct is prepared can also be used to prepare active 
recombinant hybrid . human/porcine, factor VIII or fragments 
thereof having substitution of. subunit s, continuous parts of 
domains, or domains other than A2 . One skilled in the art 
will recognize that these methods also demonstrate how other 
recombinant hybrid human /non- human, non-porcine mammalian or 
animal /animal hybrid factor VIII molecules or fragments 
thereof can be prepared in which subunits, domains, or 
continuous parts of domains are substituted. 

Recombinant hybrid human/porcine factor VIII is prepared 
starting with human cDNA (Biogen, Inc.) or porcine cDNA 
(described herein) encoding the relevant factor VIII sequence. 
In a preferred embodiment, the factor VIII encoded by the 
cDNA includes domains A1-A2-A3-C1-C2 , lacking- the entire B 
domain, and corresponds to amino acid residues 1-740' and 1649- 
23 3 2 of single chain human factor VIII (see SEQ ID NO: 2) , 
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according. to the numbering system of Wood et al. (1984) Nature 
312:330-337. 

Individual subunits, domains, or continuous parts of 
domains of porcine or human factor VIII cDNA can be and have 
been cloned and substituted for the corresponding human or 
porcine subunits, domains, or parts of domains by established 
mutagenesis techniques. For example, Lubin, I.M. et al . 
(1994) J". Biol. Chem. 269(12) :8639-8641 describes techniques 

for substituting the porcine .A2 domain for the human domain 
using convenient restriction sites. Other methods for 
substituting any arbitrary region of the factor VIII cDNA of 
one species for the factor VIII cDNA of another species 
include splicing by overlap extension ( " SOE") , as described by 
Horton, R.M. et al . (1993) Meth. Enzymol 217 : 270-279 . 

The hybrid factor VIII cDNA encoding subunits, domains, 
or parts of domains or the entire hybrid cDNA molecules are 
cloned into expression vectors for ultimate expression of 
active hybrid human/porcine factor VIII protein molecules in 
cultured cells by established techniques, as described by 
Selden, R.F., "Introduction of DNA into mammalian cells, "'in 
Current Protocols in Molecular Biology , F.M. Ausubel et al . , 
eds (1991) . 

In a preferred embodiment, a hybrid human/porcine cDNA 
encoding factor VIII, in which the porcine sequence encodes a 
domain or part domain, such as the A2 domain or part domain, 
is inserted in a mammalian expression vector, such as ReNeo, 
to form a hybrid factor VIII construct. Preliminary 
characterization of the hybrid factor VIII is accomplished by 
insertion of the hybrid cDNA into the ReNeo mammalian 
expression vector and transient expression of the hybrid 
protein in COS-7^ cells. A determination of whether active 
hybrid protein is expressed can then be made. The expression 
vector construct is used further to stably transfect cells in 
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culture, such as -baby hamster kidney cells, using methods that 
are routine in the art, such as liposome -mediated transfection 
(Lipof ectin™, Life Technologies, Inc.). ■ Expression of 
recombinant hybrid ' factor VIII protein can be confirmed, for 
example, by sequencing, Northern and Western blotting, or 
polymerase chain reaction (PCR) . Hybrid factor VIII protein 
in the culture media in which the transfected cells stably 
expressing the protein are maintained can be precipitated, 
pelleted, washed, and resuspended in an appropriate buffer, 
and the recombinant hybrid factor VIII protein purified by 
standard techniques, including immunoaf f inity chromatography 
using, for example, monoclonal anti-A2-Sepharose™ . 

In a further embodiment, the hybrid factor VIII 
comprising subunit , domain, or amino acid sequence 
substitutions is expressed as a fusion protein from a 
recombinant molecule in which sequence encoding a protein or 
peptide that enhances, for example, stability, secretion, 
detection, isolation, or the like is inserted in place 
adjacent to the factor VIII encoding sequence. Established 
protocols- for use of homologous or heterologous species 
expression control sequences including, for example, 
promoters, operators, and regulators, in the preparation of 
fusion proteins. are known and routinely used in. the art. See 
Current Protocols in Molecular Biology (Ausubel, F.M., et al. , 
eds) , Wiley Interscience , N.Y. 

The purified hybrid factor VIII or fragment thereof can 
be assayed for immunoreact ivity and coagulation activity by 
standard assays including, for example, the plasma- free factor 
VIII assay, the one -stage clotting assay, and the enzyme - 
linked immunosorbent assay using purified recombinant human 
factor VIII as a standard. 

Other vectors, including both plasmid and eukaryotic 
viral* vectors , may be used .to express a recombinant gene 
construct in eukaryotic cells depending on the preference and 
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judgment of the skilled practitioner (see, for example, 
Sambrook et al . , Chapter 16). Other vectors and expression 
systems, including bacterial, yeast, and insect cell systems, 
can be used but are not preferred due to differences in, or 
lack of, glycosylation. 

Recombinant hybrid factor VIII protein can be expressed 
in a variety of cells commonly used for culture and 
recombinant mammalian protein expression. In particular, a 
number of rodent cell lines have been found to be especially 
useful hosts for expression of large proteins. Preferred cell 
lines, available from the American Type Culture Collection, 
Rockville, MD, include baby hamster kidney cells, and Chinese 
hamster ovary (CHO) cells which are cultured using routine 
procedures and media. 

The same methods employed for preparing hybrid 
human/porcine factor VIII having subunit, domain, or amino 
acid sequence substitution can be used to prepare other 
recombinant hybrid factor VIII protein and fragments thereof 
and the nucleic acid sequences encoding these hybrids, such as 
human/non- human, non-porcine mammalian or animal /animal . 
Starting with primers from the known human DNA sequence, the 
murine and part of the porcine factor VIII cDNA have been 
cloned. Factor VIII sequences of other species for use in 
preparing a hybrid human/animal or animal/animal factor VIII 
molecule can be obtained using the known human and porcine DNA 
sequences as a starting point. Other techniques that can be 
employed include PCR amplification methods with animal tissue 
DNA, and use of a cDNA library from the animal to clone out 
the factor VIII sequence. 

As an exemplary embodiment, hybrid human/mouse factor 
VIII protein can be made as follows. DNA clones corresponding 
to the mouse homolog of the human factor VIII gene have been 
isolated and sequenced and the amino acid sequence of mouse 
factor VIII protein predicted, as described in Elder, G., et 
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al. (1993) Genomics 16 (2) : 374-379 .which also includes a 

comparison of the predicted amino acid sequences of mouse, 
human, and part of porcine factor VIII molecules. The mouse 
factor VIII cDNA sequence and predicted amino acid sequence 
are shown in SEQ ID NO: 5 and SEQ ID NO : 8 , respectively. In a 
preferred embodiment, the RNA amplification with transcript 
sequencing (RAWTS) methods described in Sarkar, G. et al . 
(1989) Science 244 : 331-334 , can be used. Briefly, the steps 
are (1) cDNA synthesis with oligo (dT) or an mRNA-specif ic 
oligonucleotide primer; (2) polymerase chain reaction (PCR) in 
which one or both oligonucleotides contains a phage promoter 
attached to a sequence complementary to the region to be 
amplified; (3) transcription with a phage promoter; and (4) 
reverse transcriptase-mediated dideoxy sequencing of the 
transcript, which is primed with a nested (internal) 
oligonucleotide. In addition to revealing sequence 

information, this method can generate an in vitro translation 

product by incorporating a translation initiation signal into 
the appropriate PCR primer: and can be used to obtain novel 
mRNA sequence information from other species. 

Substitution of amino acid(s) : 

The present invention provides active recombinant hybrid 
human/animal and animal/animal . factor VIII molecules or 
fragments thereof comprising at least one sequence including 
one or more unique amino acids of one species substituted for 
the corresponding amino acid sequence of the other species or 
fragments thereof, nucleic acid sequences encoding these 
hybrids, methods for preparing and isolating them, and methods 
for characterizing their coagulant, immunogenic and 
immunoreactive properties. 

The A2 domain is necessary for the procoagulant activity 
of the factor VIII molecule. Studies show that porcine factor 
VIII has six-fold greater procoagulant activity than human 
factor VIII (Lollar, P. et al . (1991)' J. Biol. Chem. 
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266 : 12481-12486 . 'and that the difference in coagulant activity 
between human and porcine factor VIII appears to be based on 
a difference in amino acid sequence between one or more 
residues in the human and porcine A2 domains (Lollar, P. et 
al. (1992) J. Biol. Cham. 267:23652-23657. Further, the A2 

and C2 domains and possibly a third light chain region in the 
human factor VIII molecule are thought to harbor the epitopes 
to which most, if not all, inhibitory antibodies react, 
according to Hoyer (1994) Semin. Hewatol . 31:1-5. 

Recombinant hybrid human/animal, animal /animal , or 
equivalent factor VIII molecules or fragments thereof can be 
made by substitution of at least one specific sequence 
including one or more unique amino acids from the A2 , C2 , 
and/or other domains of the factor VIII of one species for the 
corresponding sequence of the other species, wherein the amino 
acid sequences differ, as illustrated in more detail below, 
between the molecules of the two species. In an exemplary 
preferred embodiment described herein, the present invention 
provides active recombinant hybrid human/porcine factor VIII 
comprising porcine amino acid sequence substituted for 
corresponding human amino acid sequence that includes an 
epitope, wherein the hybrid factor VIII has decreased or no 
immunoreactivity with inhibitory antibodies to factor VIII. 
In a further embodiment, active recombinant hybrid factor VIII 
molecules can also be made comprising amino acid sequence from 
more than one species substituted for the corresponding 
sequence in a third species. Recombinant hybrid equivalent 
molecules can also be made, comprising human, animal, or 
hybrid factor VIII including at least one sequence including 
one or more amino acids that have no known sequence identity 
to factor VIII, as further described below. 

Any hybrid factor VIII construct having specific amino 
acid substitution as described can be assayed by standard 
procedures for coagulant activity -and for reactivity with 
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inhibitory antibodies . to. factor VIII. for identification of 
hybrid factor VIII molecules with enhanced coagulant activity 
and/or decreased antibody immunoreactivity . Hybrid molecules 
may also be identified that have reduced coagulant activity 
compared to. human or porcine factor VIII but also have 
decreased antibody reactivity. One skilled in the art will 
recognize that hybrid factor VIII molecules or fragments 
thereof having less, equal, or greater coagulant activity, 
compared to human or porcine factor VIII, is useful for 
treating patients who have a factor VIII deficiency. The 
methods described herein to prepare active recombinant hybrid 
human/porcine factor VIII with substitution of specific amino 
acids can be used to prepare active recombinant hybrid 
human/non- human, non-porcine mammalian factor VIII protein, 
hybrid animal-l/animal-2 factor VIII, and hybrid equivalent 
factor VIII or fragments thereof. 

Hybrid factor VIII molecules with altered coagulant 
activity: 

The present invention provides procoagulant recombinant 
hybrid human/animal, animal/animal, or equivalent factor VIII 
molecules or fragments thereof comprising at least one 
specific sequence including one or more unique amino acids 
having procoagulant activity in the factor VIII of one species 
substituted for the corresponding amino acid sequence of the 
factor VIII of the other species, using established site- 
directed mutagenesis techniques as described herein . The 
specific sequences to be used in the substitution are selected 
and the hybrid constructs are prepared and assayed for 
coagulant activity, as follows. Specifically provided as a 
preferred and exemplary embodiment is a hybrid human/porcine 
factor VIII comprising amino acid substitutions in the A2 
domain. It is understood* that one skilled in the art. can use 
these methods to prepare other hybrid human/animal, 
animal/animal, and equivalent, factor VIII . molecules or 
fragments thereof .having, altered coagulant activity. 
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preferably increased coagulant activity compared to human 
factor VIII. 

The basis for the greater coagulant activity in porcine 
5 factor VIII appears to be the more rapid spontaneous 

dissociation of the A2 subunit of human factor Villa than 
porcine factor Villa, which leads to loss of activity, 
according to Lollar, P. et al . (1990) J. Biol. Chejn. 265 : 1688- 

1692; Lollar, P. et al . (1992) J. Biol. Chen?. 267 : 23652- 

10 23657; Fay, P.J. et al . (1992) J". Biol. Chem. 267 : 13246-13250 . 

A comparison of the alignment of the amino acid sequences 
of the human and porcine factor VIII A2 domains (residue 
numbering starts at position 373 with respect to the full 

15 length amino acid sequence of human factor VIII, SEQ ID NO: 2) 

is shown in Figure 1C. For preparation of a hybrid 
human/porcine factor VIII molecule with altered coagulant 
activity, the initial target candidates for mutagenesis, which 
were revealed upon comparison of the human and porcine A2 

20 amino acid sequences (SEQ ID NOs : 2 and 6, respectively) 

within the human A2 domain, are shown in Table I.. 

TABLE I. HUMAN AMINO ACID SEQUENCE TARGET CANDIDATES 
FOR MUTAGENESIS (SEQ ID NO: 2) 
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Sequence Residues Mismatches Charge 

Changes 



398-403 6 4 .1 

30 434-444 10 4 3 

484-496 13 7 3 

598-603 6 4 2 

536-541 6 4 0 

713-722 10 6 2 

35 : 727-737 11 6 2 

Table I and the bold letters of Figures 1A-1B illustrate 
seven sequences in the human and pig A2 domain amino acid 
sequences (SEQ ID NOs : 2 and 6, respectively) that constitute 
40 only 17 percent of the A2 domain but include 7 0 percent of the 

sequence differences between human and porcine A2 domains. 
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- A recombinant hybrid human/porcine construct is- described 
in which amino acids Ser373 -Glu604 in the A2 domain (Ser373- 
Arg740) of human factor VIII have been replaced with the 
homologous porcine sequence . This construct does not react 
with A2 inhibitors and has the same coagulant activity as 
human B(-) factor VIII. A plasma -derived hybrid molecule is 
described that comprises a complete porcine A2 domain 
substitution in the human factor VIII that has increased 
coagulant activity compared to human factor VIII. Comparison 
of these constructs indicates that a region between residues 
Asp605 and Arg740 is responsible for the difference in 
activity between human and porcine factor VIII. This region 
can be defined more specifically by systematically making 
recombinant hybrid human/porcine factor VIII molecules with 
porcine substitutions in the region between Asp605 and Arg740 
by using established site-directed mutagenesis techniques, for 
example, the "splicing by overlap extension" (SOE) method that 
has been used extensively to make hybrid factor VIII molecules 
containing porcine substitutions in the NH 2 -terminal region of 
A2 . These molecules can be expressed in COS-7 cells and baby 
hamster kidney cells as described above. They can be purified 
to homogeneity using methods known in the art, such as 
heparin- Sepharose™ and immunoaf f inity chromatography. Protein 
concentration can be estimated . by absorption of ultraviolet 
light at A 280 / and the specific activity of the constructs can 
be determined by dividing coagulant activity (measured in 
units per ml by single stage clotting assay) by A 28 o • Human 
factor VIII has a specific activity of approximately 3000-4000 
U/A 280 / whereas porcine factor VIII has a specific activity of 
approximately 20/000 U/A 280 - In a preferred embodiment, the 
procoagulant recombinant hybrid human/porcine factor VIII has 
a specific activity of -20,000 U/A 280 and contains a minimal 
amount of porcine substitution in the -A2 domain. 

As described herein, site-directed mutagenesis techniques 
are used to identify hybrid protein with coagulant activity 
that can be enhanced; equal to, or reduced, compared to human 
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factor VIII, . but preferably is enhanced. In the hybrid 
human/porcine embodiment, specific human sequences are 
replaced with porcine sequences, preferably using the splicing 
by overlap extension method (SOE) , as described by Ho, S.N., 
et al . , 77 Gene 51-59 (1994), and in Examples 7 and 8. 
Oligonucleotide-directed mutagenesis can also be used, as was 
done to loop out the amino acid sequence for part of the human 
A2 domain (see Example 7) . As functional analysis of the 
hybrids reveals coagulant activity, the sequence can be 
further dissected and mapped for procoagulant sequence by 
standard point mutation analysis techniques. 

The present invention contemplates that hybrid factor 
VIII cDNA and protein can be characterized by methods that are 
established and routine, such as DNA sequencing, coagulant 
activity assays, mass by ELISA and by UV absorbance at 280 nm 
of purified hybrid factor VIII, specific coagulant activity 
(U/mg) , SDS-PAGE of purified hybrid factor VIII, and the like. 
Other known methods of testing for clinical effectiveness may 
be required, such as amino acid, carbohydrate, sulfate, or 
metal ion analysis. ^ 

A recombinant hybrid factor VIII having superior 
coagulant activity, compared to human factor VIII, may be less 
expensive to make than plasma -derived factor VIII and may 
decrease the amount of factor VIII required for effective 
treatment of factor VIII deficiency. 

Hybrid factor VIII molecules with reduced 

immunoreact ivi tv : 

Epitopes that are immunoreact ive with antibodies that 
inhibit the coagulant activity of factor VIII ("inhibitors" or 
" inhibitory antibodies") have been characterized based on 
known structure- function relationships' in factor VIII. 
Presumably, inhibitors could act by disrupting any of the 
macromolecular . interactions associated, with the domain 
structure of factor VIII or its associations with von 
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Willebrand factor, thrombin, factor Xa, factor IXa, or factor 
X. However, over 90 percent of inhibitory antibodies to human 
factor VIII act by binding to epitopes located in the 40 kDa 
A2 domain or 20 kDa C2 domain of factor VIII, disrupting 
specific functions associated with these domains, as described 
by Fulcher etal. (1985) Proc . Natl. Acad. Sci USA 82 : 7728- 

7732; and Scandella et al . (1988) Proc. Natl. Acad. Sci. USA 
J35: 6152-6156 . In addition to the A2 and C2 epitopes, there 
may be a third epitope in the A3 or CI domain of the light 
chain of factor VIII, according to Scandella et al . (1993) 
Blood 82.: 1767-1775 . The significance of this putative third 
epitope is unknown, but it appears to account for a minor 
fraction of the epitope reactivity in factor VIII. 

Anti-A2 antibodies block factor X activation, as shown by 
Lollar et al . (1994) J. Clin. Invest. 93.: 2497-2504 . Previous 
mapping studies by deletion mutagenesis described by Ware et 
al . (1992) Blood Coagul . Fibrinolysis 3: 703-716, located the 
A2 epitope to within a 20 kDa region of the NH 2 - terminal end 
of the 4 0 kDa A2 domain. Competition immunoradiome trie assays 
have indicated that A2 inhibitors recognize either a common 
epitope or narrowly clustered epitopes, as described by 
Scandella et al . (1992) Throm. Haemostas 67.: 665-671 , and as 

demonstrated in Example 8 . 

The present invention provides active recombinant hybrid 
and hybrid equivalent factor VIII molecules or fragments 
thereof, the nucleic acid sequences encoding these hybrids, 
methods of preparing and isolating them, and methods for 
characterizing them. These hybrids comprise human/animal , 
animal /animal, or equivalent hybrid factor VIII molecules, 
further comprising at least one specific amino acid sequence 
including one or more unique amino acids of the factor VIII of 
one species substituted for the corresponding amino acid 
sequence of the factor' VIII of the other species; or comprises 
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at least one . sequence including one or more amino acids having 
no known sequence identity to factor VIII substituted for 
specific amino acid sequence in human, animal, or hybrid 
faccor VIII. The resulting hybrid factor VIII has reduced or 
no immunoreactivity to factor VIII inhibitory antibodies, 
compared to human or porcine factor VIII. 

Using the approach described in the previous section for 
substitution of amino acids in the factor VIII molecule, 
mutational analysis is employed to select corresponding factor 
VIII amino acid sequence of one species, preferably porcine, 
which is substituted for at least one sequence including one 
or more amino acids in the factor VIII of another species, 
preferably human, or for amino acid sequence of a hybrid 
equivalent factor VIII molecule, that includes one or more 
critical region (s) in the A2 , C2 , or any other domain to which 
inhibitory antibodies are directed. The methods are described 
: in more detail below. The resulting procoagulant recombinant 
-hybrid construct has reduced or no immunoreactivity 'to 
inhibitory antibodies, compared to human factor VIII, using 
■standard assays. Through systematic substitution ^pf 

-increasingly smaller amino acid sequences followed by assay-of 
the hybrid construct for immunoreactivity, as described below, 
the epitope in any domain of a factor VIII molecule is mapped, 
substituted by amino acid sequence having less or no 
immunoreactivity, and a hybrid factor VIII is prepared. 

It is understood that one skilled in the art can use this 
approach combining epitope mapping, construction of hybrid 
factor VIII molecules, and mutational analysis of the 
constructs to identify and replace at least one sequence 
including one or more amino acids comprising an epitope in the 
A2, C2, and/or other domains to which inhibitory antibodies 
are directed and to construct procoagulant recombinant hybrid 
human/animal, animal/animal, or equivalent factor VIII or 
fragments thereof having decreased or no immunoreactivity 
compared to human or porcine factor VIII. This approach is 
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used/, as described in Example 8 , /to prepare a recombinant 
procoaguiant hybrid human/porcine factor VIII having porcine 
amino acid substitutions in the human A2 domain and no 
antigenicity to anti- factor VIII antibodies as an exemplary 
5 embodiment . . 

Usually, porcine factor VIII has limited or no reaction 
with inhibitory antibodies, to human factor VIII. The 
recombinant hybrid human/porcine factor VIII molecules having 

10 decreased or no reactivity with inhibitory antibodies based on 

amino acid substitution in the A2 domain are prepared, as an 
example of how hybrid factor VIII can be prepared using the 
factor VIII of other species and substitutions in domains 
other than A2 , as follows. The porcine A2 domain is cloned by 

15 standard cloning techniques, such as those described above and 

in Examples 6, 7, and 8, and then cut and spliced within the 
A2 domain using routine procedures, such as using restriction 
sites to cut the cDNA or splicing by overlap extension (SOE) . 
The resulting porcine amino acid sequence is substituted into 

20 the human A2 . domain to form a hybrid factor VIII . construct , 

which is *■ inserted into a mammalian expression vector, 
preferably ReNeo, stably transfected into cultured cells, 
preferably baby hamster kidney cells, and expressed, as 
described above. The hybrid factor VIII is assayed for 

25 immunoreactivity, for example with anti-A2 antibodies by the 

routine Bethesda assay or by plasma- free chromogenic substrate 
assay. The Bethesda unit (BU) is the standard method for 
measuring inhibitor titers. If the Bethesda titer is not 
measurable (<0.7 BU/mg IgG) in the hybrid, then a human A2 

30 epitope was eliminated in the region of substituted 

corresponding porcine sequence.. The epitope is progressively 
narrowed, and the specific A2 epitope can thus be determined 
to produce a hybrid human/porcine molecule with as little 
porcine sequence as possible. As described herein, a 25- 

35 . residue sequence corresponding to amino acids Arg484-Ile508 

that is critical for inhibitory immunoreactivity has been 
identified and substituted in the human A2 domain. Within 
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this sequence . are only nine differences between human and 
porcine factor VIII. This region can be further analyzed and 
substituted . 

Hybrid human/porcine factor VIII molecules having 
decreased or no reactivity with inhibitory antibodies based on 
substitution of amino acid sequence in the CI, C2 or other 
domain, with or without substitution in the A2 domain, can 
also be prepared. The C2 epitope, for example can be mapped 
using the homolog scanning approach combined with site- 
directed mutagensesis . More specifically, the procedures can 
be the same or similar to those described herein for amino 
acids substitution in the A2 domain, including cloning the 
porcine C2 or other domain, for example by using RT-PCR or by 
probing a porcine liver cDNA library with human C2 or other 
domain DNA; restriction site techniques and/or successive SOE 
to map and simultaneously replace epitopes in the C2 or other 
domain; substitution for the human C2 or other domain in B(-) 
factor VIII; insertion into an expression vector, such as 
pBluescript; expression in cultured cells; and routine assay 
for immunoreactivity . For the assays, the reactivity of-C2 
hybrid factor VIII with a C2 -specific inhibitor, MR [Scandella 
et al . (1992) Thomb. Haemostasis 67:665-671 and Lubin et al . 

(1994)], and/or other G2 specific antibodies prepared ^by 

affinity chromatography can be performed. 

The C2 domain consists of amino acid residues 2173-2332 
(SEQ ID NO: 2) . Within this 154 amino acid region, inhibitor 
activity appears to be directed to a 65 amino acid region 
between residues 2248 and 2312, according to Shima, M. et al . 
(1993) Throwb. Haemostas £9:240-246. If the C2 sequence of 
human and porcine factor VIII is approximately 85 percent 
identical in this region, as it is elsewhere in the 
functionally active regions of factor VIII, there will be 
approximately ten differences between human and porcine factor 
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VIII C2 amino acid sequence , which ' can be used as initial 
targets to construct hybrids with substituted C2 sequence. 

It is likely that clinically significant factor VIII 
epitopes are confined to the A2 and C2 domains. However, if 
antibodies to other regions (Al, A3, B, or CI domains) of 
factor VIII are identified, the epitopes can be mapped and 
eliminated by using the approach described herein for the 
nonantigenic hybrid human/porcine factor VIII molecules. 

More specifically, mapping of the putative second light 
chain epitope and/or- any other epitope in any other animal or 
human factor VIII domain can also be accomplished. Initially, 
determination of the presence of a third inhibitor epitope in 
the A3 or CI domains can be made as follows. Using human 
("H") and porcine (»p») factor VIII amino acid sequences as a 
model, Al p -A2 p -A3 p -C1 H -C2p and Al p -A2 p -A3 H -Cl p -C2 p B-domainless 
hybrids will be constructed. Inhibitor /igG from approximately 
20 patient plasmas (from Dr. Dorothea Scandella, American Red 
Cross) who have low or undetectable titers against porcine 
factor VIII will be tested against" the hybrids. If -the third 
epitope is in the A3 domain, inhibitory IgG is expected to 
react with Al p -A2 p -A3„-Cl p -C2 p but not Al p -A2 p -A3 p -C1 H -C2 p . 
Conversely, if the third epitope is in the CI domain, then 
inhibitory IgG is expected to react with Al p -A2 p -A3 p -C1 H -C2 p but 
not Al p -A2 p -A3 H -Cl p -C2 p . If a third epitope ' is identified it 
will be characterized by the procedures described herein for 
the A2 and C2 epitopes . 

For example, antibodies specific for the CI or A3 domain 
epitope can be isolated from total patient IgG by affinity 
chromatography using the Al p -A2 p -A3 H -Cl p -C2 p and A1 P -A2 P -A3 P -C1 H - 
C2 P hybrids, and ,by elimination of C2 specific antibodies by 
passage over recombinant factor VIII C2-Sepharadse™ . The 
putative third epitope will be identified by SOE constructs in 
which, • in a- preferred embodiment, portions of the human factor 
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VIII A3 or CI domain are systematically replaced with porcine 
sequence. 

Hybrid factor VIII molecules with reduced immunoqenicity : 
A molecule is immunogenic when it can induce the 
production of antibodies in human or animal. The present 
invention provides a procoagulant recombinant hybrid 
human/animal or animal/animal factor VIII molecule, hybrid 
factor VIII equivalent molecule, or fragment of either that is 
less immunogenic than wild- type human porcine factor VIII in 
human or animal, comprising at least one specific amino acid 
sequence including one or more unique amino acids of the 
factor VIII of one species substituted for the corresponding 
amino acid sequence that has immunogenic activity of the 
factor VIII of the other species; or at least one amino acid 
sequence including one or more amino acids having no known 
identity to factor VIII substituted for amino acid sequence, of 
the human, animal, or hybrid factor. This hybrid can be used 
to lower the incidence of inhibitor development in an animal 
or human and to treat factor VIII deficiency, and would be 
preferred in treating previously untreated patients w^Lth 
hemophilia. In a preferred embodiment, the hybrid factor VIII 
comprises human factor VIII amino acid sequence, further 
comprising one or more alanine residues substituted for human, 
amino acid sequence having immunogenic activity, resulting In 
a procoagulant recombinant hybrid equivalent molecule or 
fragment thereof having reduced or no immunogenicity in human 
or animal . 

The process described herein of epitope mapping and 
mutational analysis combined with substitution of non- 
ant igenic amino acid sequence in a factor VIII molecule, using 
hybrid human/porcine factor VIII, produces hybrid molecules 
with low antigenicity. Using this" model and the associated 
methods, any of the hybrid constructs described herein can be 
altered by site-directed mutagenesis techniques to remove as 
much of any functional epitope as possible to minimize the 
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ability of the immune system to recognize the hybrid factor 
VIII, thereby decreasing its immunogenicity . 

One method that can be used to further reduce the 
antigenicity and to construct a less immunogenic hybrid factor 
VIII is alanine scanning mutagenesis, described by Cunningham, 
B.C. et al. (1989) Science 244 : 1081-1085, of selected specific 
amino acid sequences in human, animal , or hybrid equivalent 
factor VIII. In alanine scanning mutagenesis, amino acid side 
chains that are putatively involved in an epitope are replaced 
by alanine residues by using site-directed mutagenesis. By 
comparing antibody binding of alanine mutants to wild- type 
protein, the relative contribution of individual side chains 
to the binding interaction can be determined. Alanine 
substitutions are likely to be especially useful, since side 
chain contributions to antibody binding are eliminated beyond 
the p carbon, but, unlike glycine substitution ,* main chain 
conformation is not usually altered. Alanine substitution 
does not impose major steric, hydrophobic or electrostatic 
effects that dominate protein-protein interactions. 

In protein antigen-antibody interactions, there usually 
are about 15-20 antigen side chains in contact with the 
antibody.- Side chain interactions, as opposed to main chain 
interactions, dominate protein-protein interactions. Recent 
studies have suggested that only a few (approximately 3 to 5) 
of these side chain' interactions contribute most of the 
binding energy. See Clackson, T. et al . (1995) Science 

2.67:383-3 86 • An extensive analysis of growth hormone epitopes 
for several murine monoclonal antibodies revealed the 
following hierarchy for side chain contributions to the 
binding energy: Arg > Pro > Glu - Asp - ?he -lie, with Trp, 
Ala, Gly, and Cys not' tested [Jin/'L. et al. (1992) J. Mol . 
Biol, 226 i 851-865] . Results with the A2 epitope described 
herein are consistent with . this , since, twelve of the 25 
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residues in the 484-508 A2 segment contain these side chains 
(Table 1) . 

The finding that certain amino acid residues are 
particularly well recognized by antibodies, indicates that 
elimination of these residues from a known epitope can 
decrease the ability of the immune system to recognize these 
epitopes, i.e., can make a molecule less immunogenic. In the 
case of the A2 epitope, immunogenic residues can be replaced 
without loss of factor VIII coagulant activity. For example, 
in HP9 , Arg484 is replaced by Ser, Pro485 is replaced by Ala, 
Arg489 is replaced by Gly, Pro492 is replaced by Leu, and 
PheBOl is replaced by Met. Further, results from the patient 
plasmas used to test immunoreactivity in hybrid human/porcine 
factor VIII constructs, described in Example 8, indicate that 
antibodies from different patients recognize the same or a 
very similar structural region in the A2 domain and that the 
residues in the A2 domain that participate in binding A2 
inhibitors appear to show little variation. Thus, the A2 
epitope included in human factor VIII residues 484-508 is an 
immunodominant epitope in that it is recognized by the human 
immune system better than other structural regions of factor 
VIII. Replacing this structure by nonantigenic factor VIII 
sequence from another species or by non- factor VIII amino acid 
sequence, while retaining full procoagulant activity, is 
expected to alter recognition of hybrid or hybrid equivalent 
factor. VIII by the immune system. 

It is anticipated that site-directed mutagenesis to 
replace bulky and/or charged residues that tend to dominate 
epitopes with small, neutral side chains (e.g., alanine) may 
produce a less immunogenic region. It is expected that a 
molecule containing a few of these substitutions at each 
significant inhibitor epitope will be difficult for the immune 
system to fit by the lock-and-key mechanism that is typical of 
antigen-antibody interactions. Because of its low 

antigenicity, such a hybrid molecule could be useful in 
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treating factor. VIII deficiency patients with inhibitors, and 
because .of its low immunogenicity / it could be useful in 
treating previously untreated patients with hemophilia A. 

A general result is that mutation of one of a few key 
residues is sufficient to decrease the binding constant for a 
given protein-protein interaction by several orders of 
magnitude. Thus, it appears likely that all factor VIII 
epitopes contain a limited number of amino acids that are 
critical for inhibitor development. For each epitope in 
factor VIII, alanine substitutions for at least one sequence 
including one or more specific amino acids having immunogenic 
activity, may produce an active molecule, that is less 
immunogenic than wild-type factor VIII. In a preferred 
embodiment, the hybrid factor VIII is B-domainless . 

The methods for preparing active recombinant hybrid or 
hybrid equivalent factor VIII with substitution of amino acid 
sequence having little or no. immunogenic activity for amino 
acid sequence in the factor VIII having immunogenic activity 
are- as follows, using hybrid human/porcine: factor VIII with 
amino acid substitutions in the A2 domain as an exemplary 
embodiment. There are 25 residues in the human factor VIII 
region 484-508;. .Site-directed mutagenesis can be used to make 
single mutants in which any of these residues is replaced by 
any of the other 19 amino acids for a total of 475 mutants. 
Furthermore, hybrid molecules having more, than one mutation 
can be constructed. 

The hybrid constructs can be assayed for antigenicity by 
measuring .the binding constant for inhibitor antibodies, as 
described by Friguet, B. et..al. (1985) J". Immunol . Methods 

77:305-319 (1985). In a preferred embodiment, the binding 
constant will be reduced by at least three orders of 
magnitude, which would, lower the^ Bethesda titer to a level 
that is clinically insignificant. For example, the IC 50 (a 
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crude measure: of the binding constant) of inhibition by A2 
antibodies was reduced in hybrid human/porcine factor VIII 
constructs HP2 , HP4 , HPS, HP7 , and HP9 , described in Example 
S, and this was associated with a reduction in Bethesda titer 
to an unmeasurable level. It is anticipated, for example, 
that a double or triple alanine mutant of human factor VIII 
(e.g. a .human factor VIII Arg484->Ala, Arg489->Ala / PheSOl- 

>Ala triple mutant) will produce a molecule with sufficiently 
low antigenicity for therapeutic use. Similar mutations can 
be made in the C2 epitope and the putative third epitope. A 
preferred embodiment comprises two or . three alanine 
substitutions into two or three factor VIII epitopes. Other 
substitutions into these regions can also be done. 

In a preferred embodiment, hybrid equivalent factor VIII 
molecules will be identified that are less antigenic and/or 
immunogenic in human and animal than either human or porcine 
factor VIII. Such hybrid equivalent constructs can- be tested 
in animals for their reduced antigenicity and/or 
immunogenicity . For example, control and factor VIII 

deficient rabbits, pigs, dogs, mice, primates, and other 
mammals can be used as animal models. In one experimental 
protocol, the hybrid or hybrid equivalent factor VIII can be 
administered systematically over a period of six months to one 
year to the animal, preferably by intravenous infusion, and in 
a dosage range between 5 and 50 Units/kg body weight, 
preferably 10-50 Units/kg, and most preferably 40 Units/kg 
body weight. Antibodies can be measured in plasma samples 
taken at intervals after the infusions over the duration of 
the testing period by routine methods, including immunoassay 
and the Bethesda assay. Coagulant activity can also be 
measured in samples with routine procedures, including a one- 
stage coagulation assay. 

The hybrid - equivalent factor VIII molecules can be tested 
in humans for their reduced antigenicity and/or immunogenicity 
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in at least two types of clinical trials: In one type of 
trial., .designed to determine whether the hybrid or hybrid 
equivalent factor . VIII is immunoreactive with inhibitory 
antibodies, hybrid or hybrid equivalent factor VIII is 
administered, preferably by intravenous infusion, to 
approximately 25 patients having factor VIII deficiency who 
have antibodies to factor VIII that inhibit the coagulant 
activity of therapeutic human or porcine factor VIII. The 
dosage of the hybrid or hybrid equivalent factor VIII is in a 
range between 5 and 50 Units/kg body weight, preferably 10-50 
Units/kg, and most preferably 40 Units/kg body weight. 
Approximately 1 hour after each administration, the recovery 
of factor VIII from, blood samples is measured in a one-stage 
coagulation assay. Samples are taken again approximately 5 
hours after infusion, and recovery is measured. Total 
recovery and the rate of disappearance of factor VIII from the 
samples is predictive of the antibody titer and inhibitory 
activity. If the antibody titer is high, factor VIII recovery 
usually cannot be measured. The recovery results are compared 
to the recovery of recovery results in patients treated with 
plasma-derived human factor VIII, recombinant human factor 
VIII,. porcine factor VIII, and other commonly used therapeutic 
forms of factor VIII or factor VIII substitutes. 

In a second type of clinical trial, designed to determine 
whether the, hybrid or hybrid equivalent factor VIII is 
immunogenic, i.e., whether patients will develop inhibitory 
antibodies, hybrid or hybrid equivalent factor VIII is 
administered, as described in the preceding paragraph, to 
approximately 100 previously untreated hemophiliac patients 
who have not developed antibodies to factor VIII. Treatments 
are given approximately every 2 weeks over a period of 6 
months, -to l year. At 1 to 3 month intervals during this 
period, blood samples are drawn and Bethesda assays or other 
antibody assays are performed to determine the presence of 
inhibitory antibodies. Recovery assays can also be done, as 
described above, after each infusion. Results are compared to 
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hemophiliac patients who receive plasma-derived human factor 
VIII, recombinant human factor VIII, porcine factor VIII, or 
other commonly used therapeutic forms of factor VIII or factor 
VIII substitutes. 

Preparation of hybrid factor VIII molecules using human 
and non-porcine, non-human mammalian factor VIII amino 
acid sequence : 

The methods used to prepare hybrid human/porcine factor 
VIII with substitution of specific amino acids can be used to 
prepare recombinant hybrid human /non- human, non-porcine 
mammalian or animal/animal factor VIII protein that has, 
compared to human or porcine factor VIII, altered or the same 
coagulant activity and/or equal or reduced immunoreactivity 
and/or immunogenicity, based on substitution of one or more 
amino acids in the A2 , C2 , and/or other domains . 

Similar comparisons of amino acid sequence identity can 
be made between human and non-human, non-porcine mammalian 
factor VIII proteins to determine the amino acid sequences in 
which procoagulant activity, anti-A2 and anti-C2 
immunoreactivity, and or immunogenicity , or immunoreactivity 
and/or immunogenicity in other domains reside. Similar 
methods can then be used to prepare hybrid human /non -human, 
non-porcine mammalian factor VIII molecules. As described 
above, functional analysis of each hybrid will reveal those 
with decreased reactivity to inhibitory antibodies, and/or 
reduced immunogenicity, and/or increased coagulant activity, 
and the sequence can be further dissected by point mutation 
analysis. 

For example, hybrid human/mouse factor VIII molecules can 
be prepared as described above. The amino acid sequence 
alignment* of the A2 domain of human (SEQ ID NO: 2) and mouse 
(SEQ ID NO: 6) is shown in Figure 1C. As reported by Elder et 
al./ the factor VIII protein encoded by the mouse cDNA (SEQ ID 
NO: 5) has 2319 amino acids, with" 74% sequence identity overall 
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to' the human sequence. (SEQ ID NO : 2 ), --£87. percent identity when 
the B . domain is excluded from the comparison) , and is 32 amino 
acids - shorter than human factor . VIII. . The amino acid 
sequences in the- mouse A and C domains ( SEQ. ID NO : 6 ) are 
highly conserved, with 84-93 percent sequence identity to the 
human sequence (SEQ ID NO: 2)/ while the B and the two short 
acidic domains .. have 42-70 percent sequence identity. 
Specifically, the Al , A2 , and A3 mouse amino acid sequences 
(SEQ ID NO: 6) are 85, 85, and 90 percent identical to the 
corresponding human amino acid sequences (SEQ ID NO: 2) . The 
CI and C2 mouse amino acid sequences are 93 and 84 percent 
identical to the corresponding human amino acid sequences. In 
the predicted- mouse factor .VIII amino acid sequence (SEQ .ID 
NO: 6) , - the Al, A2 , and A3 domains are homologous to human 
factor VIII amino acids 1-372, 373-740, and 1690-2032, 
respectively, using amino acid sequence identity for numbering 
purposes. 

The thrombin/f actor Xa and all but one activated protein 
G cleavage sites are conserved in mouse factor VIII. The 
tyrosine residue for von Willebrand factor binding is also 
conserved. 

According to Elder et. al., the nucleotide sequence (SEQ 
IDNO:5) of mouse factor VIII contains 7519 bases and has 67 
percent identity overall with .the human nucleotide sequence 
(SEQ ID NO: 1) . The 6957 base pairs of murine coding sequence 
have 82 percent sequence identity with the 7053 base pairs of 
coding sequence in human factor VIII. When the B domain is 
not included in the comparison, there is an .88 percent 
nucleotide sequence identity. 

■ Elder et al. report that . human and mouse factor VIII 
molecules are 74 percent identical overall, and that 95 
percent of the human residues .that lead to -hemophilia when 
altered are identical- in the mouse. , These data support the 
application of the same techniques used to identify amino acid 

. 44 



9S4£274A1_I 



WO 99/46274 



PCT/US99/05193 



sequence with coagulant activity and/or immunoreactivity to 
antibodies in the porcine factor VIII molecule to the mouse or 
other animal factor VIII to identify similar amino acid 
sequences and prepare hybrid molecules. 

Preparation of hybrid factor VIII molecules having 
reduced cross-reactivity using human and non-human, non- 
porcine mammalian factor VIII amino acid sequence and 
non-factor VIII amino acid sequence: 

Porcine factor VIII is used clinically to treat factor 
VIII deficiency patients who have inhibitory antibodies to 
human, factor VIII. Cross-reactivity, in which human plasma 
reacts with porcine factor VIII , can be reduced by preparation 
of hybrid porcine/non-human, non-porcine mammalian or hybrid 
equivalent factor VIII. In a preferred embodiment, a 

determination of whether human A2 , C2 , or other domain- 
specific inhibitors react with non-human, non-porcine 
mammalian ("other mammalian") factor VIII is made, using the 
routine Bethesda assay and the particular other mammalian 
plasma as the standard. Inhibitor titers are usually measured 
in plasma, so purified other mammalian factor VIII is not 
necessary. If the inhibitors do not react with the other 
mammalian factor VIII, such as murine factor VIII, the 
sequence of which is known, then corresponding other mammalian 
sequence can be substituted into the porcine epitope region, 
as identified by using human/porcine hybrids. Once the animal 
sequence is known, site directed mutagenesis techniques, such 
as oligonucleotide-mediated mutagenesis described by Kunkel, 
T.A. et al. (1991) Meth. Enzymol 204 : 125-139, can be used to 

prepare the hybrid porcine/animal factor VIII molecule. If 
other animal plasmas are less reactive with A2 , C2 , or other 
factor VIII inhibitors than murine or porcine factor VIII, the 
animal sequence corresponding to the porcine epitope can be 
determined by routine procedures, such as RT-PCR, and a hybrid 
human/animal or porcine/animal factor- VIII constructed by 
site-directed mutagenesis. Also, hybrid human/animal or 
porcine/non-porcine mammalian factor VIII having reduced 
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cross -reactivity with human plasma, .compared to porcine factor 
VIII can be . prepared that .has corresponding, amino acid 
sequence substitution from one or more other animals. In a 
further embodiment, cross-reactivity can be reduced by 
substitution of amino acid sequence having no known identity 
to factor VIII amino acid; sequence, preferably alanine 
residues using . alanine scanning mutagenesis techniques, for 
porcine epitope sequence . 

After identification of clinically significant epitopes, 
recombinant hybrid, factor VIII molecules will be expressed 
that have less than or equal cross-reactivity compared with 
porcine factor . VIII when tested, in .vitro against a broad 

survey of inhibitor plasmas. Preferably these molecules will 
be ; combined A2/C2 hybrids in which immunoreactive amino acid 
sequence in these domains is replaced by other mammalian 
sequence. Additional mutagenesis in these regions may be, done 
to reduce cross-reactivity. Reduced cross-reactivity, 

although desirable, is not necessary to produce a product that 
may have advantages over the existing porcine factor VIII 
concentrate, which produces side effects due to contaminant 
porcine proteins and may produce untoward effects, due to the 
immunogenicity ' of porcine factor VIII sequences. A hybrid 
human/other mammalian or porcine/other mammalian factor VIII 
molecule will not - contain foreign porcine * proteins . 
Additionally, the extensive epitope mapping accomplished in 
the porcine A2 domain indicates that greater than 95% of the 
therapeutic hybrid human/porcine factor VIII sequence will be 
human . 

Preparation of hybrid factor VIII equivalents: 
* The- methods for amino' acid substitution in factor VIII 
molecules described above and in the examples can also be used 
to prepare procoagulant recombinant hybrid factor VTII 
equivalent molecules or fragments thereof comprising at least 
one : amino acid sequence including one or. more amino acids 
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having no. known amino acid sequence identity to factor VIII 
("non-factor VIII sequence") substituted for at least one 
specific amino acid sequence that includes an antigenic and/or 
immunogenic site in human, animal, or hybrid factor VIII. The 
resulting active hybrid factor VIII equivalent molecule has 
equal or less reactivity with factor VIII inhibitory 
antibodies and/or less immunogenicity in human and animals 
than the unsubst ituted human, animal, or hybrid factor VIII. 

Suitable amino acid residues that can be substituted for 
those sequences of amino acids critical to coagulant and/or 
antigenic and/or immunogenic activity in human or animal 
factor VIII or hybrid human/animal factor VIII to prepare a 
hybrid equivalent factor VIII molecule include any amino acids 
having no known sequence identity to animal or human factor 
VIII amino acid sequence that has coagulant, antigenic, or 
immunogenic activity. In a preferred embodiment, the amino 
acids that can be substituted include alanine residues using 
alanine scanning mutagenesis techniques. 

Hybrid factor VIII equivalent molecules described herein 
also include those molecules in which amino acid residues 
having no known identity to animal factor VIII sequence are 
substituted for amino acid residues not critical to coagulant, 
antigenic, or immunogenic activity. 

As described above, in one embodiment of a hybrid factor 
VIII equivalent molecule, the molecule has reduced cross - 
reactivity with inhibitor plasmas. One or more epitopes in 
the cross-reactive factor VIII are identified, as described 
above, and then replaced by non- factor VIII amino acid 
sequence, preferably alanine residues, using, for example, the 
alanine scanning mutagenesis method. 

In a preferred embodiment, a procoagulant recombinant 
hybrid factor VIII equivalent molecule is prepared comprising 
at least one sequence including one or more amino acids having 
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no known sequence identity to f actor *. VIII , preferably alanine 
residues, substituted for at least one sequence including one 
or more amino acids including an epitope, and/or for at least 
one sequence including one or more amino acids including an 
immunogenic site, preferably in human factor VIII. The 
resulting hybrid equivalent factor VIII molecule or fragment 
thereof has reduced or no immunoreactivity with inhibitory 
antibodies to factor VIII and/or reduced or no immunogenicity 
in human cr animals. The methods for identifying specific 
antigenic amino acid sequence in the A2 domain of human factor 
VIII for substitution by nonantigenic porcine unique amino 
acid sequence are described in Examples 7 and 8 and are 
exemplary for identifying antigenic sequence in the A2 and 
other domains of human and animal factor VIII and for using 
site -directed mutagenesis methods such as alanine scanning 
mutagenesis to substitute non-factor VIII amino acid sequence . 

Since the human A2 epitope has been narrowed to 25 or few 
amino acids, as described in Example 8, alanine scanning 
mutagenesis can be performed on a limited number of hybrid 
factor VIII constructs having human amino acid sequence to 
determine which are procoagulant , non- immunoreact ive and/or 
nonimmunogenic hybrid factor VIII constructs based on A2 amino 
acid substitutions. In the A2 domain, the most likely 
candidates for alanine substitutions to achieve both reduced 
antigenicity . and immunogenicity in the -hybrid construct are 
Arg4 84> Pro4 85, Tyr4 87, Ser488, Arg4 89, Pro4 92, Val4 95, 
Phe501, and Ile508. The binding affinity of a hybrid 
construct comprising each of these mutants for mAb413 and a 
panel of A2 specific patient IgGs will be determined by ELISA. 
Any mutant that is active and has a - binding affinity for A2 
inhibitors that is reduced by more than 2 orders of magnitude 
is a candidate for the A2 substituted factor VIII molecule. 
Constructs having more than one mutation -will be' selected, 
based on the assumption that the more the epitope is altered, 
the less * immunogenic it will be. It is possible that there 
are other candidate residues in the region between Arg4 84- 
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Ile508, since there may be key residues for the epitope that 
are common to both human and porcine factor VIII. For 
example, charged residues are frequently involved in protein- 
protein interactions and, in fact, an alanine substitute for 
Arg490 produces a factor VIII procoagulated having only 0.2% 
of the reactivity to inhibitor of human factor VIII (Table 
VI) . Similarly, an alanine substitution for Lys493 is a 
possible candidate. 

This procedure will be carried out in the C2 epitope and 
the putative third epitope, which is thought to be in the A3 
or CI domains, as well as any other epitopes identified in 
factor VIII, to prepare hybrid equivalent factor VIII 
constructs . 

Diagnostic Assays . 

The hybrid human/animal, animal/animal, or equivalent 
factor VIII cDNA and/or protein expressed therefrom, in whole 
or in part, can be used in assays as diagnostic reagents for 
the detection of inhibitory antibodies to human or animal 
-factor VIII or to hybrid human/animal factor or equivalent 
, VIII in substrates, including, for example, samples of serum 
and body fluids of human patients with factor VIII deficiency. 
These antibody assays include assays such as ELISA assays, 
immunoblots, radioimmunoassays, immunodiffusion assays, and 
assay of factor VIII biological activity (e.g. , by coagulation 
assay) . Techniques for preparing these reagents and methods 
for use thereof are known to those skilled in the art. For 
example, an immunoassay for detection of inhibitory antibodies 
in a patient serum sample can include reacting the test sample 
with a sufficient amount of the hybrid human/animal factor 
.VIII that contains at least one antigenic site, wherein the 
amount is sufficient to form a detectable complex with the 
inhibitory antibodies* in the sample. 

Nucleic acid and amino acid probes can be prepared based 
on the sequence of the hybrid human/porcine, human /non- human, 
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non-porcine mammalian, animal/animal, or equivalent factor 
VIII cDNA or protein molecule or fragments thereof . In some 
embodiments, these can be ■ labeled using dyes or enzymatic, 
fluorescent, chemiluminescent, or radioactive labels that are 
commercially available. The amino acid probes can be used, 
for example, to screen sera or other body fluids where the 
presence of inhibitors to human, animal, or hybrid 
human/animal factor VIII is suspected. Levels of inhibitors 
can be quantitated in patients and compared to healthy 
controls, and can be used, for example, to determine whether 
a patient with a factor VIII deficiency can be treated with a 
hybrid human/animal or hybrid equivalent factor VIII. The 
cDNA probes can be used, for example, for research purposes in 
screening DNA libraries . 

Pharmaceutical Compositions . ; 

Pharmaceutical compositions containing hybrid 

human/ animal , por c ine/non - human , non - por c ine mamma 1 i an , 
animal - l/animal-2 , or equivalent factor VIII, alone or in 
combination with appropriate pharmaceutical stabilization 
compounds, delivery vehicles ,.. and/or carrier vehicles, are 
prepared according to known methods, as described in 
Remington's Pharmaceutical Sciences by E.W. Martin. 

In one- preferred embodiment, the preferred carriers or 
delivery vehicles for intravenous infusion are physiological 
saline or phosphate buffered saline. 

In another preferred embodiment, suitable stabilization 
compounds, delivery vehicles, and carrier vehicles include but 
are not limited to other human or animal proteins such as 
albumin . ■ - - 

Phospholipid vesicles or liposomal suspensions are also 
preferred as pharmaceutically acceptable carriers or delivery 
vehicles. These can be prepared according to methods known to 
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those skilled in the art and can contain, for example, 
phosphatidylserine/ -phosphatidylcholine or other compositions 
of phospholipids or detergents that together impart a negative 
charge to the surface, since factor VIII binds to negatively 
charged phospholipid membranes. Liposomes may be prepared by 
dissolving appropriate lipid (s) (such as stearoyl phosphatidyl 
ethanolamine , stearoyl phosphatidyl choline, arachadoyl 
phosphatidyl choline, and cholesterol) in an inorganic solvent 
that is then evaporated, leaving behind a thin film of dried 
lipid on the surface of the container. An aqueous solution of 
the hybrid factor VIII is then introduced into the container. 
The container is then swirled by hand to free lipid material 
from the sides of the container and to disperse lipid 
aggregates, thereby forming the liposomal suspension. 

The hybrid factor or hybrid equivalent factor VIII can be 
combined with other suitable stabilization compounds, delivery 
vehicles, and/or carrier vehicles, including vitamin K 
dependent clotting factors, tissue factor, and von Willebrand 
factor (vWf) or a fragment of vWf that contains the factor 
VIII binding site, and polysaccharides such as sucrose. ^ 

Hybrid or hybrid equivalent factor VIII can also be 
delivered by gene therapy in the same way that human factor 
VIII can be delivered, using delivery means such as retroviral 
vectors. This method consists of incorporation of factor VIII 
cDNA into human cells that are transplanted directly into a 
factor VIII deficient patient or that are placed in an 
implantable device, permeable to the factor VIII molecules but 
impermeable to cells, that is then transplanted. The 
preferred method will be retroviral-mediated gene transfer. 
In this method, an exogenous gene (e.g., a factor VIII cDNA) 
is cloned into the genome of a modified retrovirus. The gene 
is inserted into the genome of the host cell by viral 
machinery - where it will be expressed by the cell. The 
retroviral vector is modified so that it . will, not produce 
virus, preventing viral infection of the host. The general 
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principles for this type of therapy are known to those skilled 
in the art and have been reviewed; in the literature [e.g., 
Kohn, D.B. et al . (1989) Transufusion 129:812-820] . 



5 Hybrid factor VIII can be stored bound to vWf to increase 

the half -life and shelf -life of the hybrid molecule. 
Additionally, lyophilization of factor VIII can improve the 
yields of active molecules in the presence of vWf . Current 
methods for storage of human and animal factor VIII used by 

10 commercial suppliers can be employed for storage of hybrid 

factor VIII. These methods include: (1) lyophilization of 
factor VIII in a partially-purified state (as a factor VIII 
"concentrate" that is infused without further purification) ; 
(2) immunoaf f inity-purif ication of factor VIII by the 

15 Zimmerman method and lyophilization in the presence of 

albumin, which stabilizes the factor VIII; (3) lyophilization 
of recombinant factor VIII in the presence of albumin. 

Additionally, hybrid factor VIII has been indefinitely 
2 0 stable at 4° C in 0.6. M NaCl, 2 0 mM MES, and 5 mM CaCl 2 at pH 

6.0 and also can be stored frozen in these buffers and thawed 
with minimal loss of activity. 

Methods of Treatment. 

2 5 Hybrid or hybrid equivalent factor VIII is used to treat 

r uncontrolled bleeding due to factor VIII deficiency (e.g. , 
** : intraarticular, intracranial, or gastrointestinal hemorrhage) 

in hemophiliacs with and without inhibitory antibodies and in 
patients with acquired factor VIII deficiency due to the 
30 development of inhibitory antibodies. The active materials 

are preferably administered intravenously. 

Additionally, hybrid or hybrid equivalent factor VIII can 
be administered by transplant of cells genetically- engineered 

3 5 to produce the hybrid or by implantation of a device 

containing such cells, as described above. 
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In a preferred embodiment, pharmaceutical compositions of 
hybrid or hybrid equivalent factor VIII alone or in 
combination with stabilizers, delivery vehicles, and/or 
carriers are infused into patients intravenously according to 
5 the same procedure that is used for infusion of human or 

animal factor VIII. 

The treatment dosages of hybrid or hybrid equivalent 
factor VIII composition that must be administered to a patient 

10 in need of such treatment will vary depending on the severity 

of the factor VIII deficiency. Generally, dosage level is 
adjusted in frequency, duration, and units in keeping with the 
severity and duration of each patient's bleeding episode. 
Accordingly, the hybrid factor VIII is included in the 

15 pharmaceutical^ acceptable carrier, delivery vehicle, or 

stabilizer in an amount sufficient to deliver to a patient a 
therapeutically effective amount of the hybrid to stop 
bleeding, as measured by standard clotting assays. 

Factor VIII is classically defined as that substance 
present in normal blood plasma that corrects the clotting 
defect in plasma derived from individuals with hemophilia A. 
The coagulant activity in vitro of purified and partially- 
purified forms of factor VIII is used to calculate the dose, of 
factor VIII for infusions in human patients and is a reliable 
indicator of activity recovered from patient plasma and of 
correction of the in vivo bleeding defect. There are no 
reported discrepancies between standard assay of novel factor 
VIII molecules in vitro and their behavior in the dog infusion 
model or in human patients, according to Lusher, J.M. et al . 
328 New Engl. J. Med. 328.: 453-459 ; Pittman, D'.D. et v al . (1992) 
Blood 79:389-397; and Brinkhous et al . (1985) Proc . Natl. 
Acad. Sci. 82 r8752-8755. 
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Usually, the desired plasma factor VIII level to be 
achieved in the patient through administration of the hybrid 
or hybrid equivalent factor VIII is in the range of 30-100% of 
normal. In a preferred mode of administration of the hybrid 
or hybrid equivalent factor VIII, the composition is given 
intravenously at a preferred dosage in the range from about 5 
to 50 units/kg body weight, more preferably in a range of 10- 
50 units/kg body weight, and most preferably at a dosage of 
20-40 units/kg body weight; the interval frequency is in the 
range from about 8 to 24 hours (in severely affected 
hemophiliacs) ; and the- duration of treatment in days is in the 
range from 1 to 10 days or until the bleeding episode is 
resolved. See, e.g., Roberts, H.R., and MR. Jones, 

"Hemophilia and Related Conditions - Congenital Deficiencies 
of Prothrombin (Factor II, Factor V, and Factors VII to XII) , » 
Ch. 153, 1453-1474, 1460, in Hematology . Williams/ W. J.", et 
al . , ed. (1990). Patients with inhibitors may require more 
hybrid or hybrid equivalent factor VIII, or patients may 
require less hybrid or hybrid equivalent factor VIII because 
of its higher specific activity than human factor VIII or 
decreased antibody reactivity or immunogenicity . As in 
treatment with human or porcine factor VIII, the amount of 
hybrid or hybrid equivalent factor VIII infused is defined by 
the one-stage factor VIII coagulation assay and, in selected 
instances, in vivo recovery is determined by measuring the 
factor VIII in : the patient 1 s plasma after infusion. It is to 
be understood that for any particular subject, specific dosage 
regimens should be adjusted over time according to the 
individual need and the professional judgment of the person 
administering or supervising the administration of the 
compositions, and that the concentration ranges set forth 
herein are exemplary only and are not intended to limit the 
scope or practice of the claimed composition. 

Treatment- can take the form of a single intravenous 
administration of the composition or periodic or continuous 
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administration over an extended period of time, as required. 
Alternatively, hybrid or hybrid equivalent factor VIII can be 
administered subcutaneous ly or orally with liposomes in one or 
several doses at varying intervals of time. 

5 

Hybrid or hybrid equivalent factor VIII can also be used 
to treat uncontrolled bleeding due to factor VIII deficiency 
in hemophiliacs who have developed antibodies to human factor 
VIII. In this case, coagulant activity that is superior to 
10 that of human or animal factor VIII alone is not necessary. 

Coagulant activity that is inferior to that of human factor 
VIII (i.e., less than 3,000 units/mg) will be useful if that 
activity is not neutralized by antibodies in the patient's 
plasma. 

15 . 

The hybrid or hybrid equivalent factor VIII molecule and 
the methods for isolation, characterization, making, and using 
it generally described above will be further understood with 
reference to the following non-limiting examples. 

20- . 

Example 1: Assay of porcine factor VIII and hybrid 

human/porcine factor VIII. 

Porcine factor VIII has more coagulant activity than 
25 human factor VIII, based on specific activity of the molecule. 

These 1 results are shown in Table III in Example. 4. This 
conclusion is based on the use of appropriate standard curves 
that allow human porcine factor VIII to be fairly compared. 
Coagulation assays are based on the ability of factor VIII to 
30 shorten the clotting time of plasma derived from a patient 

with hemophilia A. Two types of assays were employed: the 
one -stage and the two stage assay. 

In the one-stage assay, 0.1 ml hemophilia A plasma 
35 (George King Biomedical, Inc.) was incubated with 0.1 ml 

activated partial thromboplastin reagent: (APTT) (Organon 
Teknika) and 0.01 ml sample- or standard, consisting of 
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diluted, citrated normal human plasma, for 5 minat 37°C in a 
water bath.— Incubation was followed by addition of 0.1 ml 20 
mM CaCl 2 , and the time for development of a fibrin clot was 
determined by visual inspection. 

5 

A unit of factor VIII is defined as the amount present in 
1 ml of citrated normal human plasma. With human plasma as 
the standard, porcine and human factor VIII activity were 
compared directly. Dilutions of the plasma standard or 
10 purified proteins were made into 0 . 15 M NaCl, 0 . 02 M HEPES, pH 

7.4. The standard curve was constructed based on 3 or 4 
dilutions of plasma, the highest dilution being 1/50, and on 
logio clotting time plotted against log 10 plasma concentration, 
which results in a linear plot. The units of factor VIII in 
15 an unknown sample were determined by interpolation from the 

standard curve. 

The one -stage assay relies on endogenous activation of 
factor VIII by activators formed in the hemophilia A plasma, 
whereas the two-stage assay measures the procoagulant activity 
of preactivated factor VIII. In the two-stage assay, samples 
containing factor VIII that had been reacted with thrombin 
were added to a mixture of activated partial thromboplastin 
and human . hemophilia A. plasma that had been preincubated for 
5- min at 37°C. The resulting clotting times, were then 
converted to units/ml, based on the same human standard curve 
described above. The relative activity in the two-stage assay 
was higher than in the one-stage assay because the factor VIII 
had been preactivated. 

Example 2: Characterization of the functional 

difference between human and porcine 
factor VIII. 

The isolation of porcine and human plasma-derived factor 
VIII and human recombinant factor VIII have been described in 
the literature in Fulcher, C.A. et al . (1982) Proc . Natl. 
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Acad. Sci. USA 79:1648-1652; Toole et al . (1984) Nature 
312 : 342-347 (Genetics Institute); Gitschier et al . (1984) 
Nature 312 : 326-330 (Genentech) ; Wood et al . (1984) Nature 
312 :330-337 (Genentech); Vehar et al . 312 Nature 312:337-342 

(Genentech); Fass et al . (1982) Blood .59:594; Toole et al . 

(1966) Proc. Natl. Acad. Sci. USA 83. : 593 9 - 5942 . This can be 
accomplished in several ways. All these preparations are 
similar in subunit composition, although there is a functional 
di f ference in stability between human and porcine factor VIII. 

For comparison of human recombinant and porcine factor 
VIII, preparations of highly-purified human recombinant factor 
VIII (Cutter Laboratories, Berkeley, CA) and porcine factor 
VIII [ immunopurif ied as described in Fass et al . (1982) Blood 
5_9:5 94] were subjected to high-pressure liquid chromatography 
(HPLC) over a Mono Q™ ( Pharmacia -LKB , Pi scat away, NJ) anion - 
exchange column (Pharmacia, Inc.). The purposes of the Mono 
Q™ HPLC step were elimination of minor impurities of exchange 
of human and porcine factor VIII into a common buffer for 
comparative purposes. Vials containing 1000-2000 units^of 
factor VIII were reconstituted with 5 ml H 2 0. Hepes (2-.M at 

pH 7.4) was then added to a final concentration of 0.02 M. 
Factor VIII was applied to a Mono Q™ HR .5/5 column 
equilibrated in 0.15 M NaCl, 0.02 M Hepes, 5mM CaCl 2 , at pH 
7.4 (Buffer A plus 0 . 15 M NaCl) ; washed with 10 ml Buffer A + 
0.15 M NaCl; and eluted with a 20 ml linear gradient, 0 . 15 M 
to 0.90 M NaCl in Buffer A at a flow rate of 1 ml/min. 

For comparison of human plasma-derived factor VIII 
(purified by Mono Q™ HPLC) and porcine factor VIII, 
immunoaf f inity-purif ied, plasma-derived porcine factor VIII 
was diluted 1:4 with 0 . 04 M Hepes, 5 mM CaCl 2/ 0.01% Tween-80, 
at pH 7.4, and subjected to Mono Q™ HPLC under the same 
conditions described in the previous paragraph for human 
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factor VIII. These procedures for. the. isolation of human and 
porcine factor VIII are standard for those skilled in the art. 



10 



15 



Column fractions were assayed for factor VIII activity by 
a one-stage coagulation assay. The average results of the 
assays, expressed in units of activity per A 2eo of material, 
are given in Table II, and indicate that porcine factor VIII 
has at least six times greater activity than human factor VIII 
when the one-stage assay is used. 

TABLE II: COMPARISON OF HUMAN AND PORCINE FACTOR VIII 
COAGULANT ACTIVITY 



Porcine 

Human plasma-derived 
Human recombinant 



Activity (U/A 280 ) 

21,300 
3,600 
2,400 



20 



25 



30 



35 



Exampl e 3 : 



Comparison of the stability of human and 
porcine factor VIII. 



The * results of the one-stage assay for. factor VIII 
reflect activation of factor VIII to factor Villa in the 
sample and possibly loss of formed factor Villa activity. A 
direct comparison of the stability of human and porcine factor 
VIII was made. Samples from Mono Q™ HPLC (Pharmacia, Inc., 
Piscataway, N.J.) were diluted to the same concentration and 
buffer composition and reacted with thrombin. At various 
times, samples were removed for two- stage coagulation assay. 
Typically, peak activity (at 2 min) was 10 -fold greater for 
porcine than human factor Villa, and the activities of both 
porcine and human factor Villa subsequently decreased, with 
human factor Villa 

activity decreasing more rapidly. 



40 



Generally, attempts to isolate stable human factor Villa 
are not successful even when conditions that, produce stable 



58 



BNSDOCIO: <WO_ 



9946274A1 ) > 



WO 99/46274 



PCT/US99/05193 



porcine factor Villa - are used. To demonstrate this, Mono Q™ 
HPLC-purif ied human factor VIII was activated with thrombin 
and subjected to Mono S™ cation-exchange (Pharmacia, Inc.) 
HPLC under conditions that produce stable porcine factor 
Villa, as described by Lollar et al . (1989) Biochemistry 

Human factor VIII, 4 3 /zg/ml (0.2 /xM) in 0 . 2 M NaCl, 0.01 
M Hepes, 2.5 mM CaCl 2/ at pH 7.4, in 10 ml total volume, was 
reacted with thrombin (0.036 jxM) for 10 min, at which time 
FPR-CH 2 C1 D -phenyl -prolyl -arginyl -chloromethyl ketone was 
added to a concentration of 0.2 fj.M for irreversible 
inactivation of thrombin. The mixture then was diluted 1:1 
with 40 mM 2- (N-morpholino) ethane sulfonic acid (MES) , 5 mM 
CaCl 2/ at pH 6.0, and loaded at 2 ml/min onto a Mono S™ HR 5/5 
HPLC column (Pharmacia, Inc.) equilibrated in 5 mM MES, 5 mM 
CaCl 2/ at pH 6.0 (Buffer B) plus 0.1 M NaCl . Factor Villa was 
eluted without column washing with a 2 0 ml gradient from 0.1 
M NaCl to 0.9 M NaCl in Buffer B at 1 ml/min. 

The fraction with coagulant activity in the two-stage 
assay eluted as a single peak under these conditions. .The 
specific activity of the peak fraction was approximately 7, .500 
U/A 2 q 0 . Sodium dodecyl sulf ate-polyacrylamide :gel 

electrophoresis (SDS-PAGE) of the Mono S™ factor Villa peak, 
followed by silver staining of the protein, revealed two bands 
corresponding to a heterodimeric (A3-C1-C2/A1) derivative of 
-factor VIII. Although the A2 fragment was not identified by 
silver staining under these conditions because of its low 
concentration, it was identified as a trace constituent by 
125 I-labeling. 

In contrast to the results with human factor VIII, 
porcine factor Villa isolated by Mono S w HPLC under the same 
conditions had a specific activity 1 . 6 x 10 6 U/A 280 . Analysis 
of porcine factor Villa by SDS-PAGE revealed 3 fragments 
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corresponding to Al, A2 , and A3- CI- C2 subunits, demonstrating 
that porcine factor Villa possesses .three subunits. 

The results of Mono S™ HPLC of human thrombin-activated 
factor VIII preparations at pH 6.0 indicate that human factor 
Villa is labile under conditions that yield stable porcine 
faccor Villa. However, although trace amounts of A2 fragment 
were identified in the peak .fraction, determination of whether 
the coagulant activity resulted from small amounts of 
heterotrimeric factor Villa or from heterodimeric factor Villa 
that has. a low specific activity was not possible from this 
method alone. 

A way to isolate human factor Villa before it loses its 
A2 subunit is desirable to resolve this question. To this 
end, isolation was accomplished in a procedure that involves 
reduction of the pH of the Mono S™ buffers to pH 5. Mono Q™- 
purified human factor VIII (0.5 mg) was diluted with H 2 0 to 
give a final composition of 0.25 mg/ml (1 fim) factor VIII in 
0.25 M NaCl, 0.01 M Hepes, 2.5 mM CaCl 2 , 0.005% Tween-80, at 
pH 7.4 (total volume 7.0 ml). Thrombin was added to a final, 
concentration of 0.072 /zm and allowed to . react f or . 3 min. 
Thrombin was then inactivated with FPR-CH 2 C1 (0.2 pm) . The 
mixture then was diluted 1:1 with 40 mM sodium acetate, 5 mM 
CaCl 2 , 0.01% Tween-80, at pH 5.0, and loaded at 2 ml/min onto 
a Mono S™ HR 5/5 HPLC column equilibrated in.. 0.01 -M sodium 
acetate, • 5 mM CaCl 2 , 0.01% Tween^80, at pH 5.0, plus 0.1 M 
NaCl . Factor Villa was eluted without column washing with a 
2 0 ml gradient from 0.1 M NaCl to 1.0 M NaCl in the same 
buffer at 1 ml/min. This resulted in recovery of coagulant 
activity in a peak that contained detectable amounts of the A2 
fragment as shown by SDS-PAGE and silver staining. The 
specific activity of the peak, fraction was tenfold greater 
than that recovered at pH 6 . 0 (75,000 U/A 280 v. .7,500 U/A 280 ) • 
However, in contrast to . porcine factor Villa isolated at pH 
6.0, which-is indefinitely stable at 4°C,. human factor Villa 
activity decreased • steadily * over -a period of several hours 
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after elution from Mono S™ . Additionally, the specific 
activity of factor Villa purified at pH 5.0 and assayed 
immediately is only 5% that of porcine factor Villa, 
indicating that substantial dissociation occurred prior to 
5 assay. 

These results demonstrate that both human and porcine 
factor Villa are composed of three subunits (Al, A2 , and A3- 
C1-C2) . Dissociation of the A2 subunit is responsible for the 
10 loss of activity of both human and porcine factor Villa under 

certain conditions, such as physiological ionic strength, pH, 
and concentration. The relative stability of porcine factor 
Villa under certain conditions is because of stronger 
association of the A2 subunit. 

15 

Example 4: Preparation of hybrid human/porcine 

factor VIII by. reconstitution with 
subunits . 

20 Porcine factor VIII light chains and factor VIII heavy 

chains were isolated as follows. A 0.5 M solution of EDTA at 
~: pH 7.4 was added to Mono Q™-purified porcine factor VIII to a 
::: final concentration of 0.05 M and was allowed to stand at room 
temperature for 18-24 h. An equal volume of 10 mM histidine- 

25- CI, 10 mM EDTA, 0.2% v/v Tween 80, at pH 6.0 (Buffer B) , : was 

. • added, and the solution was applied at 1 ml/min to a Mono S™ 

HR 5/5 column previously equilibrated in Buffer A plus 0.25 M 
NaCl . Factor VIII heavy chains did not bind the resin, as 
judged by SDS-PAGE. Factor VIII light chain was eluted with 

30 a linear, 20 ml, 0.1-0.7 M NaCl gradient in Buffer A at 1 

ml/min and was homogeneous by SDS-PAGE. Factor VIII heavy 
chains were isolated by mono Q™ HPLC (Pharmacia, Inc., 
Piscataway, N.J.) in the following way. Factor VIII heavy 
chains do not adsorb to mono S™ during the purification of 

35 factor VIII light chains. The fall-through material that 

contained factor VIII heavy chains was adjusted to pH 7.2 by 
addition of 0 . 5 M Hepes buffer, pH 7 . 4 , and applied to a mono 
Q™ HR5/5 HPLC column (Pharmacia, Inc.) equilibrated in 0.1 M 
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NaCl, 0.02 M Hepes, 0.01% Tween-80, pH 7.4:. . The column was 
washed with 10 ml -of this buffer, and factor VIII heavy chains 
were eluted with a 20 ml 0.1-1.0 M NaCl gradient in this 
..buffer. Human light chains and heavy chains were isolated in 
the same manner . 

Human and porcine light and heavy, chains were 
reconstituted according to the following steps. Ten f±l human 
or porcine factor VIII light chain, 10 0 jig/ml , was mixed in 1 
M NaCl, 0.02 M Hepes, 5 mM CaCl 2 , 0.01% Tween-80, pH 7.4, with 
(D..25 fil heterologous heavy chain, 60 /xg/ml , in the same 
buffer; (2) 10 ^1 0.02 M Hepes, 0.01% Tween-80, pH 7.4; (3) 5 
fil 0.6 M CaCl 2 , for 14 hr at room temperature. The mixture 
was diluted 1/4 with 0.02 M MES, 0.01% Tween-80, 5 mM CaCl 2 , 
pH 6 and applied to Mono S™ Hr5/5 equilibrated in 0.1 M NaCl, 
0.02 M MES, 0.01% Tween-80, 5mM Cacl 2 , pH 6.0. A 20 ml 
gradient was run from 0.1 - 1.0 M l NaCl in the same buffer at 
1 ml/min, and 0 . 5 ml fractions were collected. Absorbance was 
read at 280 nm of fractions, and fractions were assayed with 
absorbance for factor VIII activity by the one-stage clotting 
assay. Heavy chains were ; present in excess, because free 
light chain (not associated with heavy chain) also binds Mono 
S™; excess heavy chains ensure that free light chains are not 
part, of the preparation.. Reconstitution experiments followed 
by Mono S™ .HPLC purification were performed with all four 
possible combinations of chains: human light chain/human 
heavy chain, human light chain/porcine heavy chain, porcine 
light chain/porcine heavy chain, porocine light chain/human 
heavy chain. Table III shows that human light chain/porcine 
heavy chain factor VIII has activity comparable to native 
porcine factor VIII (Table II), indicating that structural 
elements in the porcine heavy chain are responsible for the 
increased coagulant activity of porcine factor VIII compared 
to human factor VIII. 
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TABLE III: COMPARISON OF HYBRID HUMAN/PORCINE 

FACTOR VIII COAGULANT ACTIVITY WITH 
HUMAN AND PORCINE FACTOR VIII 

5 Activity (U/A 280 ) 

Porcine light chain/porcine heavy chain 3 0,600 
Human light chain/porcine heavy chain 44,100 
Porcine light chain/human heavy chain 1,100 
10 Human light chain/human heavy chain 1,000 



Example 5: Preparation of active hybrid 

human/porcine factor VIII by 

15 reconstitution with domains. 

The porcine A1/A3-C1-C2 dimer, the porcine A2 domain, the 
human A1/A3-C1-C2 dimer, and the human A2 domain were each 
isolated from porcine or human blood, according to the method 

20" described in Lollar et al . (1992) J. Biol. Ctiem. 

267(33) : 23652-23657 . For example, to isolate the porcine 
A1/A3-C1-C2 dimer, porcine factor Villa (140 /ig) at pH 6.0 was 
raised to pH 8 . 0 by addition of 5 N NaOH for 30 minutes, 
producing dissociation of the A2 domain and 95 percent 

25 '] inactivation by clotting assay. The mixture was diluted 1:8 

with buffer B (20 mM HEPES, 5 mM CaCl 2 , 0.01% Tween-80, : pH 
7.4) and applied to a monoS column equilibrated in buffer B. 
The A1/A3-C1-C2 dimer eluted as a single sharp peak at 
approximately 0\.4 M NaCl by using* a 0.1-1.0 M NaCl gradient in 

3 0 buffer B. To isolate the porcine A2 domain, porcine factor 

Villa was made according to the method of Lollar et al . (1989) 
Biochem 28:666-674, starting with 0.64 mg of factor VIII. 
Free porcine A2 domain was isolated as a minor component (50 
fig) at 0.3 M NaCl in the MonoS™ chromatogram. 



35 



Hybrid human/porcine factor VIII molecules were 
reconstituted from the dimers and domains as follows. The 
concentrations and buffer conditions for the purified 
components were as follows: porcine A2 , 0.63 /xM in buffer A 
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(5 mM MES; 5 mM CaCl 2 , 0.01% Tween 80, pH 6/0) plus 0.3 M 
NaCl; porcine A1/A3-C1-C2 . 0.27 fiM in buffer B plus 0.4 M 
NaCl # pH 7.4; human A2 , I /zM in 0.3 M NaCl, 10 mM histidine- 
HC1, 5 mM CaCl 2/ 0.01 % Tween 20, pH 6.0; human A1/A3 -C1-C2 . 
0.18 /iM in 0.5 M NaCl, 10 mM hist idine-Cl , 2 . 5 mM CaCl 2/ 0.1% 
Tween-20, pH 6.0. Reconstitution experiments were done by 
mixing equal volumes of A2 domain and A1/A3-C1-C2 dimer. In 
mixing experiments with porcine A1/A3-C1-C2 dimer, the pH was 
lowered to 6.0 by addition of 0 . 5 M MES, pH 6.0, to 7 0 mM. 

The coagulation activities of all four possible hybrid 
factor Villa molecules - [pA2/ (hAl/A3 -C1-C2 ) ] , [hA2/ (pAl/A3 - 
C1-C2)] , [pA2/ (pAl/pA3-Cl-C2) ] , and [hA2/ <hAl/A3-Cl-C2) ] - 
were obtained by a two-stage clotting assay at various times. 

The generation of activity following mixing the A2 
domains and A1/A3-C1-C2 dimers was nearly complete by one hour 
and was stable for at least 24 hours at 37 °C. Table IV shows 
the activity of reconstituted hybrid factor Villa molecules 
when assayed at 1 hour. The two- stage assay, by which the 
specific activities of factor Villa molecules were obtained, 
differs from the one-stage assay, and the values cannot be 
compared to activity values of factor VIII molecules obtained 
by a one- stage assay. . 

TABLE IV: COMPARISON OF COAGULANT ACTIVITIES OF DOMAIN- 
SUBSTITUTED HYBRID HUMAN/ PORCINE FACTOR Villa 



Hybrid fVIIIa 



Specific 
Activity (U/mg) 



Porcine A2 + Human 
A1/A3-C1-C2 



140, 000 



Porcine A2 + Porcine 
A1/A3-C1-C2 



70, 000 



Human A2 + Porcine 
A1/A3-C1-C2 



40, 000 



Human A2 + Human 



40, 000 



A1/A3-C1-C2 
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Table IV shows that the greatest activity was exhibited 
by the porcine A2 domain/human A1/A3-C1-C2 dimer, followed by 
the porcine A2 domain/porcine A1/A3-C1-C2 dimer. 

Thus, when the A2 domain of porcine factor Villa was 
mixed with the A1/A3-C1-C2 dimer of human factor Villa, 
coagulant activity was obtained. Further, when the A2 domain 
of human factor Villa was mixed with the A1/A3-C1-C2 dimer of 
porcine factor Villa, coagulant activity was obtained. By 
themselves, the A2 , Al , and A3-C1-C2 regions have no coagulant 
activity. 

Example 6; Isolation and sequencing of the A2 domain 

of porcine factor VIII. 

Only the nucleotide sequence encoding the B domain and 
part of the A2 domain of porcine, factor VIII has been 
sequenced previously [Toole et al . (1986) Proc . Natl. Acad. 
Sci. USA 83.: 5939-5942] . The cDNA and predicted amino acid 
sequences (SEQ ID NOs : 3 and 4, respectively) for the entire 
porcine factor VIII A2 domain are disclosed herein. 

The porcine factor VIII A2 domain was cloned by reverse 
transcription of porcine spleen total RNA and PCR 
amplification; degenerate primers based on the known human 
factor VIII cDNA sequence and an exact porcine primer based on 
a part of the porcine factor VIII sequence were used. A 1 kb 
PCR product was isolated and amplified by insertion into a 
Bluescript™ (Stratagene) phagemid vector. 

The porcine A2 domain was completely sequenced by dideoxy 
sequencing. The cDNA and predicted amino acid sequences are 
as described in SEQ ID NOs : 3 and 4, respectively. 



65 



99^2 74A1 I > 



WO 99/46274 



, PCT/US99/05193 



Example 7: Preparation of recombinant hybrid 

human/animal factor VIII 

The nucleotide and predicted amino acid sequences (SEQ ID 
NOs: 1. and 2, respectively) of human factor VIII have been 
described in the literature [Toole et al . (1984) Nature 
312:342-347 (Genetics Institute); Gitschier et al . Nature 
312 : 326-330 (Genentech); Wood, et al . (1984) Nature 312 : 330- 
337 (Genentech) ; Vehar et al . Nature 312 :337-342 (Genentech)] . 

Making recombinant hybrid human/animal factor VIII 
requires that a region of human factor VIII cDNA (Biogen 
Corp.) be removed and the animal cDNA sequence having sequence 
identity be inserted. Subsequently, the hybrid cDNA is 
expressed in an appropriate expression system. As an example, 
hybrid factor VIII cDNAs were cloned in which some or all of 
the porcine A2 domain was substituted for the corresponding 
human A2 sequences. Initially, the entire cDNA sequence 
corresponding to the A2 domain of human factor VIII and then 
a smaller part of the A2 domain was looped out by 
oligonucleotide-mediated. mutagenesis, a -method commonly known 
to those skilled in the art (see, e.g., Sambrook, J., E.F. 
Fritsch, and T. Maniatis, Molecular Cloning : A Laboratory 
Manual . Chapter 15, Cold Spring Harbor Press, Cold Spring 
Harbor, 1989). The steps were as follows. 

Materials . 

Methoxycarbonyl-D-cyclohexylglycyl -glycl -arginine-p- 
nitroanilide (Spectrozyme™ Xa) and anti-factor VIII monoclonal 
antibodies ESH4 and ESH8 were purchased from American 
Diagnostica ^ (Greenwich, CT) . Unilamellar 

phosphatidylcholine/phosphatidylserine (75/25, w/w) vesicles 
were prepared according to the method of Barenholtz, Y., et 
al., 16 Biochemistry 2806-2810 (1977)). Recombinant 
desulf atohirudin' was obtained from Dr. R. B. Wallis, Ciba- 
Geigy Pharmaceuticals (Cerritos, CA) . Porcine factors IXa, X, 
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Xa, and thrombin were isolated according to the methods of 
Lollar et ai . (1984) Blood 63.: 1303-1306 , and Duffy, E.J. et 
al. (1992) J\ Biol. Chew. 207 : 7621-7827 . Albumin-free pure 

recombinant human factor VIII was obtained from Baxter-Biotech 
(Deerf ield, ID . 

Cloning of the porcine factor VIII A2 domain. 

The cDNA encoding the porcine A2 domain was obtained 
following PCR of reverse- transcribed porcine spleen mRNA 
isolated as described by Chomczyneki et al . (1987) Anal. 
Biochew. 162 : 156-159 . cDNA was prepared using the first-strand 
cDNA synthesis kit with random hexamers as primers (Pharmacia, 
Piscataway, N.J.). PCR was carried out using a 5' -terminal 
degenerate primer 5' - AARCAYCCNAARACNTGGG 3» (SEQ ID NO: 11), 
based on known limited porcine A2 amino acid sequence, and a 
3' -terminal exact primer, 5' GCTCGCACTAGGGGGTCTTGAATTC 3' J SEQ 
ID NO: 12) , based on known porcine DNA sequence immediately 3 f 
of the porcine A2 domain. These oligonucleotides correspond 
to nucleotides 1186-1203 and 2289-2313 in the human sequence 

(SEQ ID NO:l) . Amplification was carried out for 35 cycles (1 

si . 

minute 94°C, 2 minutes 50°C, 2 minutes 72°C) using Taq DNA 
polymerase (Promega . Corp. , Madison, WI) . The 1.1-kilobase 
amplified fragment was cloned into pBluescript II « KS- 
(Stratagene) at the EcoKV site using the T-vector procedure, 
as described by Murchuk, D. et al . (1991) Nucl . Acids Res. 
1^:1154. Escherichia coli XL1 -Blue -competent cells were 
transformed, and plasmid DNA was isolated. Sequencing was 
carried out in both directions using Sequenase™ version 2.0 
(U.S. Biochemical Corp., a Division of Amersham LifeScience, 
Inc., Arlington Hts, IL) . This sequence was confirmed by an 
identical sequence that was obtained by direct sequencing of 
the PCR product from an independent reverse transcription of 
spleen RNA from the same pig (Circumvent™, New England 
Biolabs, Beverly, MA) . The region containing the epitope for 
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10 



autoantibody RG was identified as 373-536 in human factor VIII 
(SEQ ID NO:2) V ^ 

Construction a nd expression of a hybrid human/porn'np 
factor VIII cDNA. ~ 

B-domainless human factor VIII (HB~, from Biogen, Inc. 
Cambridge, MA) , which lacks sequences encoding for amino acid 
. .residues 741-1648 (SEQ ID N0:2), was used as the starting 
material for construction of a hybrid human/porcine factor 
VIII. HB~ was cloned into the expression vector ReNeo . To 
facilitate manipulation, the cDNA for factor VIII was 
isolated as a Xhol/Hpal fragment from ReNeo and cloned 
into Xhol/EcoRV digested pBlueScript II KS . An 

15 oligonucleotide, 5' CCTTCCTTTATCCAAATACGTAGATCAAGAGGAAATTGAC 

3 1 - (SEQ ID NO:7), was used in a site-directed mutagenesis 
reaction using uracil -containing phage DNA, as described by 
Kunkel, T.A. et al. (1991) Meth. Enzymol 204:125-139, to 
simultaneously loop-out the human A2 sequence (nucleotides 
20 1169-2304 in SEQ ID NO:l) and introduce a SnaBI restriction 

site. The A2 -domainless human factor VIII containing plasmid 
was digested with SnaBI followed by addition of Clal linkers. 
The porcine A2 domain was then amplified by PCR using the 
phosphorylated 5 1 primer 5 ! GTAGCGTTGCCAAGAAGCACCCTAAGACG 3 1 
25 (SEQ ID NO : 8 ) and 3' primer 5* 

J GAAGAGTAGTACGAGTTATTTCTCTGGGTTCAATGAC 3' (SEQ ID NO: 9) , 

respectively. Clal linkers were added to the PCR product 
followed by ligation into the human factor Vlli-containing 
vector. The A1/A2 and A2/A3 junctions were corrected to 
3 0 restore the precise thrombin cleavage and flanking sequences 

by site-directed mutagenesis using the oligonucleotide 
shown in SEQ ID NO : 8 and nucleotides 1-22 (5' GAA . . 
. TTC in SEQ ID NO: 9) to correct the 5'- and 3 ' - terminal 
junctions, respectively. " In the resulting construct, 
35 designated HP1, the human A2 domain was exactly substituted 

with the porcine A2 domain. A preliminary product contained 
an unwanted thymine at the A1-A2 junction as a result of the 
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PCR amplification of the porcine A2 domain. This single base 
was looped out by use of the mutagenic oligonucleotide 5 f 
CCTTTATCCAAATACGTAGCGTTTGCCAAGAAG 3' (SEQ ID NO: 10). The 
resulting hybrid nucleotide sequence encoded active- factor 
VIII having human Al , porcine A2 and human A3, CI and C2 
domains . 

A region containing 63% of the porcine NH 2 -terminal A2 
domain, which encompasses the putative A2 epitope, was 
substituted for the homologous human sequence of B-domainless 
cDNA by exchanging Spel/BaniKI fragments between the 
pBluescript plasmids containing human factor VIII and 
human/porcine A2 factor VIII cDNA. The sequence was confirmed 
by sequencing the A2 domain and splice sites. Finally, a 
Spel/Apal fragment, containing the entire A2 sequence, was 
substituted in place of the corresponding sequence in HB", 
producing the HP2 construct. 

Preliminary expression of HB" and HP2 in COS-7 cells was 
tested after DEAE-dextran-mediated DNA transf ection, as 
described by Seldon, R.F., in Current Protoc ols in Molecular 
Biolocrv (Ausubel, F.M., et al . , eds) , pp. 9.21-9.26, Wiley 
Interscience, N.Y. After active factor VIII expression was 
confirmed and preliminary antibody inhibition studies were 
done, HB" and HP2 DNA were then stably transf ected into baby 
hamster kidney cells using liposome-mediated transf ection 
(Lipof ectin® Life Technologies, Inc., Gaithersburg, MD) . 
Plasmid-containing clones were selected for G418 resistance in 
Dulbecco's modified Eagle's medium-F12, 10% fetal calf serum 
. (DMEM-F12/10% fetal calf serum) containing 400 /xg/ml G418, 
followed by maintenance in DMEM-F12/10% fetal calf serum 
containing 100 /xg/ml G418. Colonies showing maximum 

expression of HB" and HP2 factor VIII activity were selected 
by .ring cloning and expanded for further characterization. 
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HB" and HP2 factor VIII expression • was compared by 
plasma-free, factor. VIII, assay, one-stage clotting, assay, and 
enzyme-linked immunosorbent assay using purified recombinant 
human factor VIII as a standard. Specific coagulant 

activities cf 260 0 and 2580 units/mg were obtained for HB" and 
HP2, respectively. HB" and HP2 produced 1.2 and 1.4 
units/ml/48 hours/10 7 cells, respectively. This is identical 
to that of the wild type construct (2,600 ± 200 units/mg). 
The specific activities of HB" and HP2 were indistinguishable 
in the plasma- free factor VIII assay. 

The biological activity of recombinant hybrid 
human/animal and equivalent factor VIII with Al, A2 , A3, CI, 
and/or C2 domain substitutions can be evaluated initially by 
use of a COS-cell mammalian transient expression system. 
Hybrid human/animal and equivalent cDNA can be transfected 
into COS cells, and supernatants can be analyzed for factor 
VIII activity by use of one-stage and two-stage coagulation 
assays as described above. Additionally, factor VIII activity 
can be measured by use of a chromogenic substrate assay, which 
is more sensitive and allows analysis of larger numbers of 
samples. Similar assays are standard in the assay of factor 
VIII activity [Wood et al . (1984) Mature 312:330-337; Toole et 
al. (1984) Nature 312:342-347] . Expression of recombinant 
factor VIII in COS cells is also a standard procedure [Toole 
et al. (1984) Nature 312:342-347; Pittman et al. (1988) Proc . 
Natl. Acad. Sci . USA 85.: 2429-2433] . 

The human factor VIII cDNA used as starting materials for 
the recombinant molecules described herein has been expressed 
in COS cells yielding a product with biological activity. 
.This material, as described above, can be used as a standard 
to compare hybrid human/animal factor VIII molecules. The 
activity in the assays, is. converted to a specific activity for 
proper comparison of the hybrid molecules. For this, a 
measurement of the mass of factor VIII produced by the cells 
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is necessary and can be done by immunoassay with purified 
human and/or animal factor VIII as standards. Immunoassays 
for factor VIII are routine for those skilled in the art [See, 
e.g., Lollar et al . (1988) Blood 71:137-143] . 

Exsirr.pl e 8. Determination of inhibitory activity in 

hybrid human /animal and equivalent factor 
VIII. 

Sequences of human and animal factor VIII likely to be 
involved as epitopes (i.e., as recognition sites for 
inhibitory antibodies that react with factor VIII) can be 
determined using routine procedures, for example through use 
of assay with antibodies to factor VIII combined with site 
directed mutagenesis techniques such as splicing by overlap 
extension methods (SOE) , as shown below. Sequences of animal 
factor VIII that are not antigenic compared to corresponding 
antigenic human sequences can be identified, and substitutions 
can be made to insert animal sequences and delete human 
sequences according to standard recombinant DNA methods. 
Sequences of amino acids such as alanine residues having no 
known sequence identity to factor VIII can also be substituted 
by standard recombinant DNA methods or by alanine scanning 
mutagenesis. Porcine factor VIII reacts less than human 
factor VIII with some inhibitory antibodies; this provides a 
basis for current therapy for patients with inhibitors. After 
the recombinant hybrids are made, they can be tested in vitro 
for reactivity with routine assays, including the Bethesda 
inhibitor assay. Those constructs that are less reactive than 
native human factor VIII and native animal factor VIII are 
candidates for replacement therapy. 

The epitopes to which most, if not all, inhibitory 
antibodies reactive with human factor VIII are directed are 
thought to reside in two regions in the 2332 amino acid human 
factor VIII molecule, the A2 domain (amino acid residues 373- 
740) and the C2 domain (amino acid residues 2173-2332, both 
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sequences shown . in SEQ ID NO: 2) -. The A2 epitope has been 
eliminated by. making a recombinant hybrid human-porcine 
factor VIII molecule in which part of the human A2 domain is 
replaced by the porcine sequence having sequence identity to 
the replaced human amino acid sequence. This was 

accomplished, as described in example 7, by cloning the 
porcine A2 domain by standard molecular biology techniques and 
then cutting and splicing within the A2 domain using 
restriction sites. In the resulting construct, designated 
HP2, residues 373-604 (SEQ ID NO: 4) of porcine factor VIII 
were substituted into the human A2 domain. HP2 was assayed 
for immunoreactivity with anti-human factor VIII antibodies 
using the following methods . 

Factor VIII enzyme -linked immunosorbent assay. 

Microtiter plate wells were coated with 0.15 ml of 6 
/xg/ml ESH4,. a human factor VIII light-chain antibody, and 
incubated overnight. After the plate was washed three times 
with H 2 0, the wells were blocked for 1 hour with 0.15 M NaCl, 
10 mM sodium phosphate, 0.05% Tween 20, 0.05% nonfat dry milk, 
0.05% sodium azide, pH 7.4. To increase sensitivity, samples 
containing factor VIII were activated with 3 0 nM thrombin for 
15 minutes. Recombinant desulf atohirudin then was added at 
100 nM to inhibit thrombin.. The plate was washed again- and 
0.1 ml of sample or pure recombinant human factor VIII (10-600 
ng/ml) , used as the standard, were added. Following a 2 hour 
incubation, the plate was washed and 0.1 ml of biotinylated 
ESH8 , another factor VIII light-chain antibody, was added to 
each well. ESH8 was biotinylated using the Pierce 

sulf osuccinimidyl-6- (biotinamide) hexanoate biotinylation kit. 
After a 1 hour incubation, the plate was washed and 0.1 ml of 
strepavidin alkaline phosphatase was added to each well. The 
piate was developed, using ' the Bio-Rad alkaline phosphatase 
substrate reagent kit, and the resulting absorbance at 4 05 nm 
for each well was determined by using a Vmax microtiter plate 
reader (Molecular Devices, Inc., Sunnyville, CA) . Unknown 
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factor VIII concentrations were determined from the linear 
portion of the factor VIII standard curve. 

Factor VIII assays. 

HB~ and HP2 factor VIII were measured in a one -stage 
clotting assay, which was performed as described above [Bowie, 
E.J.W., and C.A. Owen, in Disorders of Hemostasis (Ratnoff and 
Forbes, eds) pp. 43-72, Grunn & Stratton, Inc., Orlando, FL 
(1984)], or by a plasma-free assay as follows. HB" or HP2 
factor VIII was activated by 4 0 nM thrombin in 0,15 M NaCl , 2 0 
nM HEPES, 5 mM CaCl 2 / 0.01% Tween 80, pH 7.4, in the presence 
of 10 nM factor IXa, 425 nM factor X, and 50 [iM unilamellar 
phosphatidylserine/phosphatidylcholine (25/75, w/w) vesicles. 

After 5 minutes, the reaction was stopped with 0.05 M EDTA and 
10 0 nM recombinant desulf atohirudin, and the resultant factor 
Xa was measured by chromogenic substrate assay, according to 
the method of Hill-Eubanks et al (1990) J. Biol. Chem. 
265 : 17854-17858 . Under these conditions, the amount of factor 
Xa formed was linearly proportional to the starting factor 
VIII concentration as judged by using purified recombinant 
human factor VIII (Baxter Biotech, Deerfield, IL) as- the 
standard. 

Prior to clotting assay, HB" or HP2 factor VIII were 
concentrated from 4 8 hour conditioned medium to 10-15 units/ml 
by heparin-Sepharose™ chromatography. HB" or HP2 factor VIII 
were added to hemophilia A plasma (George King Biomedical) to 
a final concentration of 1 unit/ml. Inhibitor titers in RC or 
MR plasma or a stock solution of mAb 413 IgG (4 fxM) were 
measured by the Bethesda assay as described by Kasper, C.K. et 
al. (1975) " Thromb. Diath. Haemorrh 34 : 869-872 . Inhibitor IgG 
was prepared as described by Leyte, A. et al . (1991) J. Biol. 
Chem. 266 i74r0-746 - 
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HP2 does not react with anti-A2 . antibodies . Therefore, 
residues 373-603 must contain an epitope for anti-A2 
antibodies. 

Preparation of hybrid human-porcine factor VIII and assay 
by splicing by overlap extension (SQE) . 

Several more procoagulant recombinant hybrid 
human/porcine factor VIII B-domainless molecules with porcine 
amino acid substitutions in the human A2 region have been 
prepared to further narrow the A2 epitope. Besides 
restriction site techniques-, the "splicing by overlap 
extension" method (SOE) as described by Ho et al . (1989) Gene 

77:51-59, has been used to substitute any arbitrary region of 
porcine factor VIII cDNA. In SOE, the splice site is defined 
by overlapping oligonucleotides that can be amplified to 
produce the desired cDNA by PCR. Ten cDNA constructs, 
designated HP4 through HP13 , have been made. They were 
inserted into the ReNeo expression vector, stably transfected 
into baby hamster kidney cells, and expressed to high levels 
[0.5-1 fig (approximately 3-6 units) /10 7 cells/24 hours] as 
described in Example 7. Factor VIII coagulant activity was 
determined in the presence and absence of a model murine 
monoclonal inhibitory antibody specific for the A2 domain, 
mAb413 . In the absence of inhibitor, all of the constructs 
had a specific coagulant activity that was indistinguishable 
from B(-) human factor VIII. 

The hybrid human/porcine factor VIII constructs were 
assayed for reactivity with the anti-A2 inhibitor mAb413 using 
the Bethesda assay [Kasper et al . (1975) Thromb. Dia.th. 

Haemorrh. 34.: 869-872] . The Bethesda unit (BU) is the standard 

method for measuring inhibitor titers. The results are shown 
in Table V, and are compared to recombinant human factor VIII. 
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TABLE V: COMPARISON OF IMMUNOREACTIVITY OF AMINO ACID- 
SUBSTITUTED HYBRID HUMAN/PORCINE FACTOR VIII 



Construct 



Porcine 
Substitution 



Inhibition 
mAb413 (BU/mg IgG) 



Human B ( - ) f VI 1 1 


None 


1470 


HP4 


373-540 


<0,7 


HPS 


373-508 


<0 . 7 


HP6 


373-444 


1450 


HP7 


445-508 


<0 . 7 


HPS 


373-483 


1250 


HPS 


484-508 


<0 . 7 


HP10 


373-403 


1170 


HP11 


404-508 


<0 . 7 


HP12 


489-508 


<0 . 7 


HP13 


484-488 


<0 . 7 



The -boundaries of porcine substitutions are defined by 
the first amino acids that differ between human and porcine 
factor VIII at the NH 2 -terminal and C-terminal ends of the 
insertion. As shown in Table V, if the Bethesda titer is not 
measurable (<0.7 BU/mg IgG) , then an A2 epitope lies in the 
region of substituted porcine sequence. The epitope has been 
progressively narrowed to residues 484-509 (SEQ ID NO;2), 
consisting of only 25 residues, as exemplified by non- 
reactivity of mAb413 with HP9 . Among constructs HP4 through 
HP11, HP 9 was the most "humanized" construct that did not 
react with the inhibitor. This indicates that a critical 
region in the A2 epitope is located within the sequence 
Arg484-Ile508 . 



Based on a comparison between human and porcine factor 
VIII of the amino acid sequence in this critical region, two 

35 more constructs, HP12 and HP13 , were made, in which 

corresponding porcine amino acid sequence was substituted for 
human amino acids 489-508 and 484-488, respectively. Neither 
reacts with mAb413. This indicates that -residues on each side 
of the Arg4 88-Ser4 8 9 bond are important for reaction with A2 

40 inhibitors. In HP12 only 5 residues are non-human, and in 

HP13 only 4 residues are non-human. The 484-508, 484-488, and 
489-508 porcine substituted hybrids displayed decreased 

75 



BNSDOC1D: <WO 994627 AA\ J _> 



WO 99/46274 



PCT/US99/05193 



inhibition by A2 inhibitors from, four patient plasmas, 
suggesting that there is little variation in the structure of 
the A2 epitope according to. the inhibitor population response. 

The reactivity of the most humanized constructs/ HP9, 
HP12, and HP13 , with two anti-A2 IgG5 preparations prepared 
from inhibitor plasmas was determined. Like mAb413 / these 
antibodies did . not react with HP9, HP12, and HP13 , but did 
react with the control constructs HP(-) and HP8 . 

The region between 484-508 can be. further analyzed for 
final identification of the critical A2 epitope, using the 
same procedures . 

The methods described in Examples 7 and 8 can be used to 
prepare other hybrid human/non-porcine mammalian factor VIII 
with amino acid substitution in the human A2 or other domains, 
hybrid human/animal or animal/animal factor VIII with amino 
acid substitution in any domain, or hybrid factor VII 
equivalent molecules or fragments of any of these, such hybrid 
factor VIII having reduced or absent immunoreactivity with 
anti-factor VIII antibodies. 

Example 9. Elimination of human factor VIII A2 

inhibitor reactivity by site- 
directed mutagenesis 

Example 8 showed that substitution of the porcine 
sequence bounded by residues 484 and 50 8 into the human factor 
VIII A2 domain yields a molecule that has markedly decreased 
reactivity with a panel of A2-specific factor VIII inhibitors 
[see also . Healey et al . (1995) J. Biol. Chem. 270:14505- 
14509] . In this region, there are 9 amino acid differences 
between human and porcine factor VIII. These nine residues in 
human B-domainless factor VIII, R484, P485, Y487, P488, R489, 
P492, V495, F501, and 1508 (using the single letter amino 
code) , were individually changed to alanine by site-directed 
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mutagenesis. Additionally, Mlul and Sac2 restriction sites 

were placed in the factor VIII cDNA at sites 5 ! and 3' 
relative to the A2 epitope, without changing the amino acids 
corresponding to these sites, to facilitate cloning. The nine 
mutants were stably transfected into baby hamster kidney cells 
and expressed to high levels. All nine produced biologically 
active factor VIII. They were partially purified and 
concentrated by heparin- Sepharose chromatography as described 
by Healey et al . 

The mutants have been characterized by their reactivity 
with the murine monoclonal inhibitor MAb413 as in Example 7. 
This inhibitor recognizes the same or a very closely clustered 
epitope in the A2 domain as all human inhibitors studied to 
date. Inhibitor reactivity was measured using the Bethesda 
assay. Briefly, the Bethesda titer of an inhibitor is the 
dilution of inhibitor that inhibits factor VIII by 50%- in a 
standard one-stage factor VIII clotting assay. For example, 
if solution of antibody is diluted 1/420 and it inhibits the 
recombinant factor VIII test sample by 50%, the Bethesda titer 
is 420 U. In the case of a pure monoclonal like MAb413:, the 
mass of antibody is known, so the results are expressed in 
Bethesda units (BU) per mg MAb413 . To find the 50% inhibition 
point, a range of dilutions of MAb413 was made and 50% 
inhibition was found by a curve fitting procedure. The 
results are as follows: 
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Table VI 



Mutation : MAb413 titer (BU/mg) % 

Reactivity* 

Wild-type, B (-) fVIII 9400 

484 - A 160 1.7 

P485 - A 4000 42 

Y487 - A 50 0.53 

P488 - A 3500 37 

R489 - A 1.6 0 . 015 

R490 - A < — > <0 .2> 

P492 - A 630 6.7 

V495 - A 10700 113 

F501 - A 11900 126 

1508 - A 5620 60 



* Relative to wild- type 

These results indicate that it is possible to reduce the 
antigenicity of factor VIII toward the model A2 inhibitor by 
over a factor of 10 by making alanine substitutions at 
positions 484, 487, 489, and 492. The reactivity of R489 - A 
is reduced by nearly 4 orders of magnitude. Any of these 
alanine substitutions can be therapeutically useful to reduce 
the antigenicity and the immunogenic ity of factor VIII. 

The results confirm the efficacy of alanine -scanning 
mutagenesis and further demonstrate that biological activity 
is retained even though the amino acid sequence has been 
altered within an epitope reactive to an inhibitory antibody. 
Five of the nine sites where the human and porcine sequences 
differ are also sites where the human and murine sequences 
differ. ; The factor VIIIs having alanine substitutions at 
these positions are therefore examples of a hybrid factor VIII 
equivalent molecule having a sequence with no known sequence 
identify with any presently known mammalian factor VIII. 

Further modification, e.g*. by combining two alanine 
substitutions, can also provide greatly reduced antigenicity 
for a wider range of patients, since polyclonal variant 
antibodies differing from patient to patient can react with 
variants of the factor VIII A2 epitope. In addition, 
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immunogenicity (the capacity to induce antibodies) is further 
reduced by incorporation of more than one amino acid 
substitution. Such substitutions can include both alanine, 
porcine -specific amino acids, or other amino acids known to 
have low immunogenic potential. The substitutions at 
positions 490, 495 and 501 are likely to be useful in reducing 
immunogenicity. In addition, these substitutions are likely 
to reduce reactivity to certain patient antibodies. 

Other effective, antigenicity-reducing amino acid 
substitutions, besides alanine, can be made as long as care is 
taken to avoid those previously noted as being major 
contributors to antigen -antibody binding energy, or having 
bulky or charged side chains. Amino acids whose substitutions 
within an epitope reduce the antigenic reactivity thereof are 
termed " immunoreact ivi ty- reduc ing " amino acids herein. 
Besides alanine, other immunoreactivity-reducing amino acids 
include, without limitation, methionine, leucine, serine and 
glycine. It will be understood that the reduction of 
immunoreact ivity achievable by a given amino acid will also 
depend on any effects the substitution may have on protein 
conformation, epitope accessibility and the like. 

Example 10. 

Klenow fragment, phosphorylated Clal linkers, Not I 
linkers, T4 ligase, and Tag DNA polymerase were purchased from 
Promega (Madison, Wisconsin) . Polynucleotide kinase was 
purchased from Life Technologies, Inc., Gaithersburg, 
Maryland. y 32 P-ATP. (Redivue, >5000Ci/mmol) was purchased from 
Amersham. pBluescript II KS- and E. coli Epicurean XLl-Blue 
cells were purchased from Stratagene (La Jolla, California) . 
Synthetic oligonucleotides were purchased from Life 
Technologies, Inc. or Cruachem, Inc. 5 ' -phosphnrylated. primers 
were used when PCR products were produced for cloning 
purposes. Nucleotide (nt) numbering of oligonucleotides used 
as primers for polymerase chain reaction (PCR) amplification 
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of porcine fVIII cDNA or genomic DNA uses the human fVIII cDNA 
as reference (Wood et al . (1984) supra). 

Porcine spleen total RNA was isolated by acid ' guanidinium 
thiocyanate-phenol-chloroform extraction [Chomczynski et al . 
(1987) Anal. Biochem. 1£2:156-159] . Porcine cDNA was prepared 
from total spleen RNA using Moloney murine leukemia virus 
reverse transcriptase (RT) and random hexamers to prime the 
reaction (First-Strand cDNA Synthesis Kit, Pharmacia Biotech) 
unless otherwise indicated. RT reactions contained 45 mM 
Tris-Cl, pH 8.3, 68 mM KCl, 15 mM DTT, 9 mM MgCl 2/ 0.08 mg/ml 
bovine serum albumin and 1 . 8 mM deoxynucleotide triphosphate 
(dNTP) . Porcine genomic DNA was isolated from spleen using a 
standard procedure (Strauss, W.M. (1995) In Current Protocols 
in Molecular Biology , F. M. Ausubel et al . , editors, John 
Wiley & Sons, pp. 2.2.1-2.2.3). Isolation of DNA from agarose 
gels was done using Geneclean II (Bio 101) or Quiex II Gel 
Extraction Kit (Qiagen) . 

PCR reactions were done using a Hybaid. OmniGene 
thermocycler . For PCR reactions employing Taq DNA polymerase, 
reactions included 0.6 mM MgCl 2 , 0.2 mM dNTPs , 0.5 /zM 
oligonucleotide primers, 50 U/ml polymerase and 0.1 volume of 
first strand cDNA reaction mix.. Except where indicated 
otherwise, PCR products ' were gel purified, blunt-ended with 
Klenow fragment, precipitated with ethanol, and either ligated 
to the EcoRV site of dephosphorylated pBluescript II KS- or 
ligated with phosphorylated Clal linkers using T4 ligase, 
digested with Clal, purified by Sephacryl S400 chromatography, 
and ligated to Clal-cut, dephosphorylated pBluescript II KS - . 
Ligations were done using T4 DNA ligase (Rapid DNA ligation 
kit, Boehringer Mannheim) except where indicated. otherwise. 
Insert -containing pBluescript II KS- plasmids were used to 
transform E. coll Epicurean XLl-Blue cells. 
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Sequencing of plasmid DNA was done using an Applied 
Biosystems 373a automated DNA sequencer and the PRISM dye 
terminator kit or manually using Sequenase v. 2.0 sequencing 
kit (Amersham Corporation) . Direct sequencing of PCR products, 
.including 32 P-end labelling of oligonucleotides was done using 
a cycle sequencing protocol (dsDNA Cycle Sequencing System, 
Life Technologies) . 

Isolation of porcine fVIII cDNA clones containing 5 T UTR 
sequence, signal peptide and Al domain codons . 

The .porcine fVIII cDNA 5 f to the A2 domain was amplified 
by nested RT-PCR of female pig spleen total RNA using a 5 1 
rapid amplification of cDNA ends (5' -RACE) protocol (Marathon 
cDNA Amplification, Clontech, Version PR554 53) . This included 
first strand cDNA synthesis using a lock-docking oligo (dT) 
primer [Borson, N.D. et al . (1992) PCR Methods Appl . 
2: 144 -148] , second strand cDNA synthesis using E. coli DNA 
polymerase I, and ligation with a 5 1 extended double stranded 
adaptor, SEQ ID NO: 13 

5' -CTA ATA CGA CTC ACT ATA GGG CTC GAG CGG CCG CCC GGG CAG GT- 
3 

3 1 -H 2 N-CCCGTCCA->P0 4 -5 1 
whose short strand was blocked at the 3' end with an amino 
group to reduce non-specific PCR priming and which was 
complementary to the 8 nucleotides at the 3 1 end (Siebert, 
P.D., et al. (1995) Nucleic. Acids. Res. 23: 1087-1088 ) . The 
first round of PCR was. done using an adaptor- specif ic 
oligonucleotide, SEQ ID NO: 14 5' -CCA TCC TAA TAC GAC TCA CTA 
TAG GGC-3 1 (designated API) as sense primer, and a porcine 
fVIII A2 domain specific oligonucleotide SEQ ID NO:15 B'-CCA 
TTG ACA TGA AGA CCG TTT CTC-3 ■ (nt 2081-2104) as antisense 
primer. The second round of PCR was done using a nested, 
adaptor-specific oligonucleotide, SEQ ID NO:16 5 1 -ACT CAC TAT 
AGG GCT CGA GCG GC-3 1 (designated AP2) as sense primer, and a 
nested, porcine A2 domain-specific oligonucleotide SEQ ID 
N0;17 5* -GGG TGC AAA GCG CTG ACA TCA GTG-3 1 (nt 1497-1520) as 
antisense primer. PCR was carried out using a commercial kit 
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(Advantage cDNA PCR core kit) which employs- an antibody- 
mediated hot start protocol [Kellogg, D.E. et al . (1994) 
BioTechniques 16:1134-113-7]. PCR conditions included 
denaturation at 94 °C for 60 sec, followed by 3 0 cycles (first 
PCR) or 25 cycles (second PCR) of denaturation for 30 sec at 
94 °C, annealing for 30 sec at 60°C and elongation for 4 min at 
68°C using tube temperature control. This procedure yielded a 
prominent =1.6 kb product which was consistent with 
amplification of a fragment extending approximately 150 bp 
into the 5' UTR. The PCR product was cloned into pBluescript 
using Clal linkers. The inserts of four clones were sequenced 
in both directions . 

The sequence of these clones included regions 
corresponding to 13 7 bp of the 5' UTR, the signal peptide, the 
Al domain and part of the A2 domain. A consensus was reached 
in at least 3 of 4 sites. However, the clones contained an 
average of 4 apparent PCR-generated mutations, presumably due 
to the multiple rounds of PCR required to generate a clonable 
product. Therefore, we used sequence obtained from the signal 
peptide region to design a sense strand phosphorylated PCR 
primer, SEQ ID NO: 18 5 1 -CCT CTC GAG CCA CCA TGT CGA GCC ACC 
ATG CAG CTA GAG CTC TCC ACC TG-3 1 , designated RENEOPIGSP, for 
synthesis of another PCR product to confirm the sequence and 
for cloning into an expression vector. The sequence in bold 
represents the start codon. The sequence 5' to this represents 
sequence identical to that 5 ! of the insertion site into the 
mammalian expression viector ReNeo used for expression of fVIII 
(Lubin et al . (1994) supra) . • This site includes an Xhol 

cleavage site (underlined) . RENEOPIGSP. and the nt 1497-1520 
oligonucleotide were used to prime a Taq DNA polymerase- 
mediated PCR reaction using porcine female spleen cDNA as a 
template. DNA polymerases from several other manufacturers 
failed to yield a detectable product. PCR conditions included 
denaturation at 94 °C for four min, followed by 35 cycles of 
denaturation for 1 min at 94°C, annealing for 2 min at 55°C 

82 



9946274A1 J_> 



WO 99/46274 



PCT/US99/05193 



and. elongation for 2 min at 72°C, followed by a final 
elongation step for 5 min at 72°C.The PCR product was cloned 
into pBluescript using Clal linkers. The inserts of two of 
these clones were sequenced in both directions and matched the 
consensus sequence . 

Isolation of porcine fVIII cDNA clones containing A3. CI 
and 5' half of the C2 domain codons . 

Initially, two porcine spleen RT-PCR products, 
corresponding to a B-A3 domain fragment (nt 4519-5571) and a 
C1-C2 domain fragment (nt 6405-6990) were cloned. The 3' end 
of the C2 domain that was obtained extended into the exon 2 6 
region, which is the terminal exon in fVIII. The B-A3 product 
was made using the porcine- specif ic B domain primer, SEQ ID 
NO: 19 5' CGC GCG GCC GCG CAT CTG G CA AAG CTG AGT T 3', where 
the underlined region corresponds to a region in porcine fVIII 
that aligns with nt 4519-4530 in human fVIII. The .5 1 region of 
the oligonucleotide includes a NotI site that was originally 
intended for cloning purposes. The antisense primer used in 
generating the B-A3 product, SEQ ID NO: 20 5 1 -GAA ATA AGC CCA 
GGC TTT GCA GTC RAA-3 1 was based on the reverse complement of 
the human fVIII cDNA sequence at nt 5545-5571. The PCR 
reaction contained 50 mM KCl, 10 mM Tris-Cl, pH 9.0, 0.1% 
Triton X-100, 1.5 mM MgCl 2 , 2.5 mM dNTPs, 20 /uM primers, 25 
units/ml Taq DNA polymerase and 1/2 0 volume of RT reaction 
mix. PCR conditions were denaturation at 94°C for 3 min, 
followed by 3 0 cycles of denaturation for 1 min at 94° C, 
annealing for 2 min at 50 °C and elongation for 2 min at 72°C. 
The PCR products were phosphorylated using T4 DNA kinase and 
NotI linkers were added. After cutting with NotI, the PCR 
fragments were cloned into the NotI site of BlueScript II KS- 
and transformed into XLl-Blue cells. 

The C1-C2 product was made using the known human cDNA 
sequence to synthesize sense and antisense primers, SEQ ID 
NO:21 5 1 -AGG AAA TTC CAC TGG AAC CTT N-3 ' (nt 6405-6426) and 
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SEQ ID NO: 22 5»-CTG GGG GTG AAT TCG AAG GTA GCG N-3 ' (reverse 
complement of nt 6966-6990) , respectively. PCR conditions were 
identical to those used to generate the B-A2 product. The 
resulting fragment was . ligated to the pNOT. cloning vector 
using the Prime PCR Cloner Cloning System (5 Prime-3 Prime, 
Inc., Boulder, Colorado) and grown in JM109 cells. 

The B-A3 and C1-C2 plasmids were partially sequenced to 
make the porcine-specific sense and antisense 
oligonucleotides, SEQ ID NO:23 5 1 -GAG TTC ATC GGG AAG ACC TGT 
TG-3 1 (nt 4551-4573) and SEQ ID NO:24 5 ' -ACA GCC CAT CAA CTC 
CAT GCG AAG- 3 1 (nt 6541-6564), respectively. These 
oligonucleotides were used as primers to generate a 2013 bp 
RT-PCR product using a Clontech Advantage cDNA PCR kit. This 
product, which corresponds to human nt 4551-6564, includes the 
region corresponding to the light chain activation peptide, (nt 
5002-5124), A3 domain (nt 5125-6114) and most of the CI domain 
(nt 6115-6573) . The sequence of the C1-C2 clone had 
established that human and porcine cDNAs from nt 6565 to the 
3' end of the CI domain were identical. The PCR product cloned 
into the EcoRV site of pBluescript II KS- . Four clones were 
completely sequenced in both directions. A consensus was 
reached in at least 3 of 4 sites. 

Isolation of porcine fVIII cDNA clones containing the 3 1 
half of the C2 domain codons . 

The C2 domain of human fVIII (nucleotides 6574-7053) is 
contained within exons 24-26 [Gitschier J. et al . (1984) 
Nature 312 : 326-3301 . Human exon 26 contains 1958 bp, 

corresponding nucleotides 6901-8858. It includes 1478 bp of 3' 
untranslated sequence. Attempts to clone the exon 26 cDNA 
corresponding to the 3 ' end of the C2 domain and the 3 ' UTR by 
3 1 RACE [Siebert et al . (1995) supra], inverse PCR [Ochman, H. 

et al. (1990) Biotechnology (N. Y) . jB:759-760], restriction 

site PCR [Sarkar, G. et al . (1993) PCR Meth, Appl . 2:318-322], 

"unpredictably primed" PCR [Dominguez, 0. et al. (1994) 
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Nucleic. Acids Res. 22: 3247-3248] and by screening a porcine 
liver cDNA library failed. 3' RACE was attempted using the 
same adaptor- ligated double stranded cDNA library that was 
used to successfully used to clone the 5' end of the porcine 
fVIII cDNA. Thus, the failure of this method was not due to 
the absence of cDNA corresponding to exon 26. 

A targeted gene walking PCR procedure [Parker, J.D. et 
al . (1991) Nucleic. Acids. Res. JL9: 3055-3060] was used to 
clone the 3' half of the C2 domain. A porcine-specific sense 
primer, SEQ ID NO:25 5 ' -TCAGGGCAATCAGGACTCC-3 1 (nt 6904-6924) 
was synthesized based on the initial C2 domain sequence and 
was used in a PCR reaction with nonspecific "walking" primers 
selected from oligonucleotides available in the laboratory. 
The PCR products were then targeted by primer extension 
analysis [Parker et al . (1991) BioTechniques 10:94-101] using 
a 32 P-end labelled porcine-specific internal primer, SEQ ID 
NO : 2 6 5 ' - CCGTGGTGAACGCTCTGGACC - 3 1 (nt 6932-6952). 

Interestingly, of the 40 nonspecific primers tested, only two 
yielded positive products on primer extension analysis and 
.these two corresponded to an exact and a degenerate human 
sequence at the 3' end of the C2 domain: SEQ ID NO: 27 5'- 
GTAGAGGTCCTGTGCCTCGCAGCC-3 1 (nt 7030-7053) and SEQ ID~NO:28 
5 1 - GTAGAGSTSCTGKGCCTCRCAKCCYAG - 3 (nt 7027-7053). These 

primers had initially been designed to yield a product by 
conventional RT-PCR but failed to yield sufficient product 
that could be visualized by ethidium bromide dye binding. 
However, a PCR product could be identified by the more 
sensitive primer extension method. This product was gel- 
purified and directly sequenced. This extended the sequence of 
porcine fVIII 3» to nt 7026. 

Additional sequence was obtained by primer extension 
analysis of a nested PCR product generated using the adaptor- 
ligated double -stranded cDNA library used in- the 5 ' -RACE 
protocol described previously. The first round reaction used 
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the porcine exact primer SEQ ID NO: 29 5 ! - 
CTTCGCATGGAGTTGATGGGCTGT.-3 ' (nt 6541-6564)" and the API primer. 
The • second round reaction used SEQ ID NO:30 5 f - 
AATGAGGACTCCTCCACCCCCG-3 ' (nt 6913-6934) and the AP2 primer. 
Direct PCR sequencing extended the sequence 3' to the end of 
the C2 domain (nt 7053) . The C2 domain sequence was unique 
except at nt 7045 near the 3' end of the C2 domain. Analysis 
of repeated PCR reactions yielded either A, G or a double read 
of A/G at this site. 



Sequencing was extended into the 3 1 UTR using two 
additional primers, SEQ ID N0:31 5 ' -GGA TCC ACC CCA CGA GCT 
GG-3' (nt 6977-6996) and SEQ ID NO:32 5 ' -CGC CCT GAG GCT CGA 
GGT TCT AGG - 3 1 (nt 7008-7031). Approximately 15 bp of 3' UTR 
15 sequence were obtained, although the sequence was unclear at 

several sites. Several antisense. primers then were synthesized 
based on the best estimates of the 3 1 untranslated sequence. 
These primers included the reverse complement of the TGA stop 
codon at their 3 ! termini. PCR products were obtained from 

2 0 both porcine spleen genomic DNA and porcine spleen cDNA that 

were visualized by agarose gel electrophoresis and ethidium 
bromide staining using a specific sense primer SEQ ID NO: 33 
5 1 -AAT CAG GAC TCC TCC ACC CCC G-3 ' (nt 6913-6934) and the 3' 
UTR antisense primer, SEQ ID NO: 34 5 1 - CCTTGCAGGAATTCGATTCA- 3 1 , 
25 To obtain sufficient quantities of material for cloning 

purposes, a second round of PCR was done using a nested sense 
primer,, SEQ ID NO:35 5 ' -CCGTGGTGAACGCTCTGGACC-3 1 (nt 
6932-6952) and the same antisense primer. The 141 bp PCR 
product was cloned into EcoRV-cut pBluescript II KS- . 

3 0 Sequence of three clones derived from genomic DNA and three 

clones derived from cDNA was obtained in both directions. The 
sequence was unambiguous except at nt 7045, where genomic DNA 
was always A and cDNA was always G. 

35 
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Multiple DNA sequence alignments of human, porcine, and 
mouse fVIII (Fia. 1A-1H) . 

Alignments of the signal peptide, Al , A2 , A3, CI, and C2 
5 regions were done using the CLUSTALW program [Thompson, J.D. 

et al . (1994) Nucleic. Acids. Res. 22 : 4673 -4680] . Gap open and 

gap extension penalties were 10 and 0.05 respectively. The 
alignments of the human, mouse, and pig B domains have been 
described previously [Elder et al . (1993) supra] . The human 

10 A2 sequence corresponds to amino acids 373-740 in SEQ ID NO:2. 

The porcine A2 amino acid sequence is given in SEQ ID NO:4, 
and the mouse A2 domain amino acid sequence is given in SEQ ID 
NO: 6, amino acids 392-759. 

15 Example 11. Expression of active, recombinant B- 

domainless porcine factor VIII (PB~ ) 

Materials 

Citrated hemophilia A and normal pooled human plasmas 

20 .were purchased from George King Biomedical, Inc. Fetal bovine 

serum, geneticin, penicillin, streptomycin, DMEM/F12 medium 
and AIM-V medium were purchased from Life Technologies, Inc. 
-Tag DNA polymerase was purchased from Promega. Vent DNA 
polymerase was purchased from New England Biolabs . Pfu DNA 

25 polymerase and the phagemid pBlueScript II KS" were purchased 

from Stratagene. Synthetic oligonucleotides were purchased 
from Life Technologies or Cruachem, Inc. Restriction enzymes 
were purchased from New England Biolabs or Promega. 5 1 - 
phosphorylated primers were used when PCR products were 

30 produced for cloning purposes. Nucleotide (nt) numbering of 

oligonucleotides used as primers for polymerase chain reaction 
(PCR) amplification of porcine fVIII cDNA or genomic DNA uses 
the human fVIII cDNA as reference [Wood et al . (1984) Nature 
312 : 330-337] . A fVIII expression vector, designated HB"/ReNeo, 

35 was obtained from Biogen, Inc. HB"/ReNeo contains ampicillin 

and geneticin resistance genes and a human fVIII cDNA that 
lacks the entire B domain, defined as the Ser741-Argl648 
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cleavage .fragment produced by thrombin. To simplify 
mutagenesis of fVHI C2 domain cDNA, which is at the 3' end of 
the f VI 1 1 insert in ReNeo , a Not I site was introduced two 
bases 3 ' to the stop codon of HB~/ReNeo by splicing-by-overlap 
extension (SOE) mutagenesis [Horton, R.M., et al . (1993) 
Methods Enzymol . 217:270-279] . This construct is designated 
HB'ReNeo/ Not J. 

Total RNA was isolated by acid guanidinium thiocyanate- 
phenol- chloroform extraction [Chomczynski , P. et al . (1987) 
Anal. Biochem. 162. ; 156 - 159] . cDNA was synthesized from mRNA 
using Moloney murine leukemia virus reverse transcriptase (RT) 
and random hexamers according to instructions supplied by the 
manufacturer (First-Strand cDNA Synthesis Kit , Pharmacia 
Biotech) . Plasmid DNA was purified using a Qiagen Plasmid Maxi 
Kit (Qiagen, Inc.). PCR reactions were done using a Hybaid 
OmniGene thermocycler using Tag, Vent, or Pfu DNA polymerases. 
PCR products were gel purified, precipitated with ethanol, and 
ligated into plasmid DNA using T4 DNA ligase (Rapid DNA 
ligation kit, Boehringer Mannheim) . Insert-containing plasmids 
were used to transform E. coll Epicurean XLl-Blue cells. All 
novel fVIII DNA sequences generated by PCR were confirmed by 
dideoxy sequencing using an Applied Biosystems 373a automated 
DNA sequencer and the PRISM dye terminator kit. 

Construc tion of a hybrid fVIII expression vector, HP20. 
containing the porcine C2 domain. 

A porcine fVIII cDNA corresponding to the 3' end of the 
CI domain and all of the C2 domain was cloned into pBluescript 
by RT-PCR from spleen total RNA using primers based on known 
porcine fVIII cDNA sequence [Healy, J.F. et al . (1996) Blood 
88 : 4209-4214] . This construct and HB"/ReNeo were used as 
templates to construct a human Cl-porcine C2 fusion product in 
pBlueScript by SOE mutagenesis.- The C1-C2 fragment in this 
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plasmid was removed with ApaJ and NotI and ligated into 
ApaJ/NotJ-cut HB"/ReNeo/ NotI to produce HP2 0 / ReNeo/No t J . 

Construction of B-domain deleted hybrid human/porcine 
fVIII containing the porcine light chain (HP18) - 

The human fVIII light chain consists of amino acid 
residues Aspl649-Tyr2332 . The corresponding residues in the 
porcine fVIII cDNA were substituted for this region of HB" to 
produce a hybrid human/porcine fVIII molecule designated HP18. 
This was done by substituting a PCR product corresponding to 
porcine A2 region, the A3 domain, the CI domain, and part of 
the C2 domain for the corresponding region in HP2 0. To 
facilitate constructions, a synonymous AvrJJ site was 
introduced into nt 2273 at the junction of the A2 and A3 
domains of HP20 by SOE mutagenesis. 

Construction of B-domain deleted hybrid human/porcine 
fVIII containing the porcine signal peptide, Al domain 
and A2 domain (HP22)- 

The human fVIII signal peptide, Al domain and A2 domains 
consist of amino acid residues Met ( -19 ) -Arg740 . The 
corresponding residues in the porcine fVIII cDNA were 
substituted for this region of HB" to produce a molecule 
designated HP22 . Additionally, a synonymous AvrJJ site .was 
introduced into nt 2273 at the junction of the A2 and A3 
domains of HP22 by SOE mutagenesis. HP22 was constructed by 
fusion of a porcine signal peptide-Al -partial A2 fragment in 
pBlueScript [Healy et al . (1996) supra] with a B-domainless 
hybrid human/porcine fVIII containing the porcine A2 domain, 
designated HP1 [Lubin et al . (1994) supra] . 

Construction of porcine B domainless fVIII-(PB") 
A Spel/NotI fragment of HP18/BS (-k AvrJJ) was digested 
with Avrll/NotI and ligated into Avr II /Not J-digested HP22/BS 
(+ AvrJJ) to produce a construct PB"/BS ( + AvrJJ) , which 
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consists of the porcine, f VIII lacking the entire B domain. 
PB- was cloned into.ReNeo by ligating an Xba/NotI fragment of 
PB'/BS-'(+ Avrll) into HP22/ReNeo/ NotI (+ Avrll) . 

Expression of recombinant fVIII molecules 

PB~/ReNeo/I\fotX ( + Avrll) and HP22 /ReNeo/Not I (+AvrII) 
were transiently transfected into COS cells and expressed as 
described previously [Lubin, i\m. et al . (1994) J. Biol. Chem. 

2^9:8639-8641]. HB~/ReNeo/Not I and no DNA (mock) were 
transfected as a control. 

The fVIII activity of PB", HP22, and HB" were measured by 
a chromogenic assay as follows. Samples of fVIII in COS cell 
culture supernatants were activated by 4 0 nM thrombin in a 
0.15 M NaCl, 20 mM HEPES, 5Mm CAC12, 0.01% Tween-80, pH 7 . 4 in 
the presence of 10 nM factor IXa, 425 nM factor X, and 50 
unilamellar phosphatidylserine- [phosphatidycholine (25/75 w/w) 
vesicles. After 5 min, the reaction was stopped with 0.05 M 
EDTA and 10 0 nM recombinant desulf atohirudin and the resultant 
factor Xa was measured by chromogenic substrate assay . In the 
chromogenic substrate assay, 0.4 mM Spectrozyme Xa was added 
and the rate of para-nitroanilide release was measured by 
measuring the absorbance of the solution at 4 05 nm. , 

Results of independently transfected duplicate cell 

culture supernatants (absorbance at 405 nm per minute) 

HB": 13.9 
PB": 139 
HP22: 100 
mock: <0.2 

These results indicate that porcine B-domainless fVIII 
and a B-domainless fVIII consisting of the porcine Al and A2 
subunits are active and suggest that they have superior 
activity to human B-domainless fVIII. 
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PB" was partially purified and concentrated from the 
growth medium by heparin- Sepharose chromatography. Heparin- 
Sepharose (10 ml) was equilibrated with 0.075 M NaCl , 10 mM 
HEPES, 2.5 mM CaCl 2 , 0.005% Tween-80, 0.02% sodium azide, pH 
5 7.40. Medium (100-200 ml) from expressing cells was applied to 

the heparin-Sepharose , which then was washed with 30 ml of 
equilibration buffer without sodium azide. PB" was eluted with 
0.65 M NaCl, 20 mM HEPES, 5 mM CaCl 2/ 0.01% Tween-80, pH 7.40 
and was stored at -80 °C. The yield of fVIII coagulant 
10 activity was typically 50-75%. 

Stable expression of porcine B-domainless fVIII (PB") 
Transfected cell lines were maintained in Dulbecco 1 s 
modified Eagle's medium-F12 containing 10% fetal bovine serum, 
15 50 U/ml penicillin, 50 (xg/ml streptomycin. Fetal bovine serum 

was, heat inactivated at 50°C for one hour before use. HB" 
/ReNeo and PB'ReNeo/ItfotT (+ Avrll) were stably transfecfed 
into BHK cells and selected for geneticin resistance using a 
< : general protocol that has been described previously [Lubin et 
20 al. (1994) Biol. Chem. 269:8639-8641] except that expressing 

" cells were maintained in growth medium containing 600 /xg/ml 
' geneticin. Cells from Corning T- 75 flasks grown to confluence 
were transferred to Nunc triple flasks in medium containing 
600 /ig/ml geneticin and grown to confluence. The medium-was 
25 removed and replaced with serum- free, AIM-V medium (Life 

Technologies, Inc.) without genet icin . Factor VIII expression 
was monitored by one-stage factor VIII coagulant activity 
(vide supra) and 100-150 ml of medium was collected once daily 
for four to five days. Maximum expression levels in medium 
30 for HB" and PB" were 1-2 units per ml and 10-12 units per ml of 

factor VIII coagulant activity, respectively. 

Purification of PB" 

PB" was precipitated from culture supernatant using 60% 
35 saturated ammonium sulfate and then purified by W3-3 

immunoaf f inity chromatography and mono Q high pressure liquid 
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chromatography as described previously for- the purification of 
plasma -derived porcine f actor VIII [Lollar et al . (1993) 
.Factor Vlll/f actor Villa. Methods Enzymol . 222: 128-143] The 
specific coagulant activity of PB" was measured by a one-stage 
coagulation assay [Lollar et al . (1993) supra] and was similar 
to plasma-derived porcine factor VIII. 

When analyzed by SDS-polyacrylamide gel electrophoresis, 
the PB~ preparation contained three bands of apparent 
molecular masses 160 kDa, 82 kDa, and 76 kDa . The 82 kDa and 
76 kDa bands have been previously described as heterodimer 
containing the A1-A2 and ap-A3-Cl-C2 domains (where ap refers 
to an activation peptide) [Toole et al. (1984) Nature 312 : 342 - 
347] . The 160 kDa band was transferred to a polyvinylidene 
fluoride membrane and subjected to NH2 -terminal sequencing, 
which yielded Arg- Ile-Xx-Xx-Tyr . (where Xx represents 
undermined) which is the NH2 -terminal sequence of single chain 
factor VIII [Toole et al . (1984) supra] . Thus, PB" is 

partially processed by cleavage between the A2 and A3 domains, 
such that it consists of two forms, a single chain Al-A2-ap- 
A3-C1-C2 protein and a Al-A2/ap-A3 -C1-C2 heterodimer. Similar 
processing of recombinant HB~ has been reported [Lind et al . 
(1995) Eur. J. Biochem. 232:19-27]. 

Characterization of Porcine factor VIII 

We ' have determined the cDNA sequence of porcine fVIII 
corresponding to 137 bp of the 5 ! UTR, the signal peptide 
coding region (57 bp) , and the Al (1119 bp) , A3 (990 bp) , CI 

(456 bp), and C2 (483 bp) domains. Along with previously 
published sequence of the B domain and light chain activation 
peptide regions [Toole et al. (1986) supral and the A2 domain 

[Lubin et al . (1994) supra], the sequence reported here 

completes the determination of the porcine fVIII cDNA 
corresponding to the translated product . - A fragment that 
included the 5' UTR region, signal peptide,* and Al domain cDNA 
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was cloned using a 5 1 -RACE RT-PCR protocol. A primer based on 
human C2 sequence was successful in producing an RT-PCR 
product that led to cloning of the A3, CI, and .5 ' half of the 
C2 domain. The cDNA corresponding to the 3' half of the C2 
5 domain and 3 1 UTR cDNA proved difficult to clone. The 

remainder of the C2 domain ultimately was cloned by a targeted 
gene walking PCR procedure [Parker et al . (1991) supra]. 

The sequence reported herein SEQ ID NO: 36 was unambiguous 
except at nt 7045 near the 3 ! end of the C2 domain, which is 
either A or G as described hereinabove. The corresponding 
codon is GAC (Asp) or AAC (Asn) . The human and mouse codons 
are GAC and CAG (Gin) , respectively. Whether this represents 
a polymorphism or a reproducible PCR artifact is unknown. 
Recombinant hybrid human/porcine B-domainless fVIII cDNAs 
containing porcine C2 domain substitutions corresponding to 
both the GAC and AAC codons have been stably expressed with no 
detectable difference in procoagulant activity. This indicates 
that there is not a functional difference between these two C2 
domain variants . 

The alignment of the predicted amino acid sequence of 
full-length porcine fVIII SEQ ID NO:37 with the published 
human [Wood et al . (1984) supra] and murine [Elder et al . 
(1993) supra] sequences is shown in Fig. 1A-1H along with 
sites for post-translational modification, proteolytic 
cleavage, and recognition by other macromolecules . The degree 
of identity of the aligned sequences is shown in Table VII. 
As noted previously, the B domains of these species are more 
divergent than the A or C domains. This is consistent with 
the observation that the B domain has no known function, 
despite its large size [Elder et al . (1993) supra; Toole et 
al. (19>a6> supra}. The results of the present invention 
confirm that the B domain or porcine fVIII is not necessary 
for activity. Based on the sequence data presented herein, 
porcine fVIII having all or part of the B-domain deleted can 
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be synthesized by expressing the porcine fVIII coding DNA 
. having deleted, therefrom all or part of codons of the porcine 
B domain. There is also more divergence of sequences 
corresponding to the Al domain APC/f actor IXa cleavage peptide 
(residues 337-372) and the light chain activation peptide 
(Table VII) . The thrombin cleavage site at position 336 to 
generate the 337-372 peptide is apparently lost in the mouse 
since this residue is glutamine instead of arginine [Elder et 
al . (1993) supra] . The relatively rapid divergence of thrombin 
cleavage peptides (or in mouse fVIII a possibly vestigial 337- 
3 72 activation peptide) has been previously noted for the 
f ibrinopeptides [Creighton, T. E. (1993) In Proteins : 
Structures and Molecular Properties . W.H. Freeman, New York, 
pp. 105-13 8] . Lack of biological function of these peptides 
once cleaved has been cited as a possible reason for the rapid 
divergence . Arg5 6 2 in human fVIII has been proposed to be the 
more important cleavage site for activated protein C during 
the inactivation of fVIII and fVIIIa [Fay, P.J. et al . (1991) 
J. Biol. Chem. 266 : 20139-201451 . This site is conserved in 
human, porcine and mouse fVIII. 

Potential N- linked glycosylation sites are also shown in 
bold in Fig. 1A-1H. There are eight conserved N- linked 
glycosylation sites: one in the Al domain, one in the A2 
domain, four in the B domain, one in the A3 domain, and one in 
the CI domain. The 19 A and C domain cysteines are conserved, 
whereas there is divergence of B domain cysteines. Six of the 
seven disulfide linkages in fVIII are found at homologous 
sites in factor V and ceruloplasmin, and both C domain 
disulfide linkages are found in factor V [McMullen, B . A. et 
al. (1995.) Protein Sci. 4:740-746]. Human fVIII contains 
sulfated tyrosines at positions 346, 718, 719, 723, 1664, and 
1680 [Pittman, D.D. et al . (1992) Biochemistry 3JL : 3315-3325 ; 
Michnick, D.A. et al. (1994) J\ Biol. Chem. 269.: 20095-20102] . 
-These residues are conserved in mouse fVIII and porcine fVIII 
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(Fig. 1) , although the CLUSTALW program failed to align the 
mouse tyrosine corresponding to Tyr346 in human ~f VII I. 

Mouse and pig plasma can correct the clotting defect in 
human hemophilia A plasma, which is consistent with the level 
of conservation of residues in the A and C domains of these 
species. The procoagulant activity of porcine fVIII is 
superior to that of human fVIII [Lollar, P. et al . (1992) J . 
Biol. Cheru. 267 : 23652 - 23657] . The recombinant porcine factor 
VIII (B domain-deleted) expressed and purified as herein 
described also displays greater specific coagulant activity 
than human fVIII, being comparable to plasma-derived porcine 
fVIII. This may be due to a decreased spontaneous 

dissociation rate of the A2 subunit from the active A1/A2/A3- 
C1-C2 fVIIIa heterotrimer. Whether this difference in 
procoagulant activity reflects an evolutionary change in 
function as an example of species adaptation [Perutz, M.F. 
(1996) Adv. Protein Chem. 36.*- 213 -244] is unknown. Now that the 
porcine fVIII cDNA sequence corresponding to the translated 
product is complete, homolog scanning mutagenesis [Cunningham, 
B.C., et al. (1989) Science 243: 1330-1336] may provide a way 
to identify structural differences between human and porcine 
fVIII that are responsible for the superior activity of * the 
latter. 

Porcine fVIII is typically less reactive with inhibitory 
antibodies that arise in hemophiliacs who have been transfused 
with fVIII or which arise as autoantibodies in the general 
population. This is the basis for using porcine fVIII 
concentrate in the management of patients with inhibitory 
antibodies [Hay and Lozier (1995) supra] . Most inhibitors are 
directed against epitopes located' in the A2 domain or C2 
domain [Fulcher, C.A. et al . (1985) Proc. Natl. Acad. Sci . USA 
82:7728-7732; Scandella , ' D . et al . (1988) Proc. Natl. Acad. 
Sci. USA £5:6152-6156 ; Scandella, D. et al . (1989) Blood 
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74: 1618-1626] . Additionally, an epitope of unknown 

significance has been identified that is in either the A3 or 
CI domain [Scandella et al . (1989) supra; Scandella, D. et al . 
(1993) Blood 82:1767-1775; Nakai, H. et al / (1994) Blood 
84:: 224a] The A2 epitope has been mapped to residues 484-508 
by homo log scanning mutagenesis [Healey et al . (1995) supra] . 
In this 25 residue segment, there is relatively low proportion 
of identical sequence (16/25 or 64%) . It is interesting that 
this region, which appears to be functionally important based 
on the fact that antibodies to it are inhibitory, apparently 
has been subjected to relatively more rapid genetic drift. 
Alignment of the porcine A2 domain and A3 domains indicate 
that the A2 epitope shares no detectable homology with the 
corresponding region in the A3 domain. 

The C2 inhibitor epitope of human fVIII has been proposed 
to be located to within residues 2248-2312 by deletion mapping 
[Scandella, D. et al . (1995) Blood 86.: 1811-1819] . Human and 
porcine fVIII are 83% identical in this 65 residue segment. 
However, homolog scanning mutagenesis of this region to 
characterize the C2 epitope has revealed that a major 
determinant of the C2 epitope was unexpectedly located in the 
region corresponding to human amino acids 2181-2243 (SEQ ID 
NO: 2) and Fig. 1H. 

Human-porcine hybrid factor VIII proteins were made in 
which various portions of the C2 domain of human factor VIII 
were replaced by the corresponding portions of porcine factor 
VIII, using the strategy herein described. (Example 8) The 
synthesis of the various C2 -hybrid factor VIIIs was 
accomplished by constructing hybrid coding DNA, using the 
nucleotide sequence encoding the porcine C2 region given in 
SEQ ID NO ..37. Each hybri.d DNA was expressed in traasfected 
cells, such that the hybrid factor VIIIs could be partially 
purified from the growth medium. Activity, in the absence of 
any inhibitor, was measured by the one-stage clotting assay. 
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A battery of five human inhibitors was used to test each 
hybrid factor VIII. The inhibitor plasmas containing anti 
factor VIII antibody had been previously shown- to be directed 
against human C2 domain, based on the ability of recombinant 
human C2 domain to neutralize the inhibition. In all the test 
plasmas, the inhibitor titer was neutralized greater than 79% 
by C2 domain or light chain but less than 10% by recombinant 
human A2 domain. In addition the C2 -hybrid factor VIIIs were 
tested against a murine monoclonal antibody, which binds the 
C2 domain, and like human C2 inhibitor antibodies, it 
inhibited the binding of factor VIII to phospholipid and to 
von Willebrand factor. 

By comparing the antibody inhibitor titers against the 
C2- hybrid factor VIIIs, the major determinant of the human C2 
inhibitor epitope was shown to be the region of residues 2181- 
2243 (SEQ ID NO : 2 , see also Fig. 1H) . Anti-C2 antibodies 
directed to a region COOH- terminal to residue 2253 were not 
identified in four of the five patient sera. In comparing 
hybrids having porcine sequence corresponding to human amino 
acid residues numbers 2181-2199 and 2207-2243, it was apparent 
that both regions contribute to antibody binding. The porcine 
amino acid sequence corresponding to human residues 2181-2243 
is numbered 1982-2044 in SEQ ID NO:37. The sequence of 
porcine DNA encoding porcine amino acids numbered 1982-2044 is 
nucleotides numbered 5944-6132 in SEQ ID NO: 35. 

Referring to Fig. 1H, it can be seen that in the region 
2181-2243, there are 16 amino acid differences between the 
human and porcine sequences. The differences are found at 
residues 2181, 2182, 2188, 2195-2197, 2199, 2207, 2216, 2222, 
2224-2227, 2234, 2238 and 2243. Amino acid replacement at one 
or more of these numbered residues can be carried out to make 
a modified human factor VIII non-reactive to human anti-C2 
inhibitor antibodies. Alanine scanning mutagenesis provides 
a convenient method for generating alanine substitutions for 
naturally-occurring residues, as previously described. Amino 
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acids other than alanine can be substituted as well, as 
described .herein . . Alanine substitutions for individual amino 
acids, .especially those which are non- identical between 
human/porcine or . human/mouse or which are most likely to 
contribute to antibody binding, can yield a modified factor 
.VIII with reduced reactivity to inhibitory antibodies. 

In addition, the strategy of inserting amino, acids with 
lower potential to be immunogenic in the defined region of 
residues 2181-2243 yields modified factor VIIls having reduced 
immunogenicity. Reduced immunogenicity factor VIII is useful 
as a . factor VIII supplement for treatment of hemophilia A 
patients in preference to natural- sequence factor VIII. 
Patients . treated with reduced immunogenicity factor VIII are 
less likely, to develop inhibitory antibodies, and are 
therefore less likely to suffer from reduced effectiveness of 
treatment over their lifetimes. 

Figures 1A-1H taken together provide an aligned sequence 
comparison of the human, pig and mouse factor VIII amino acid 
sequences.* Fig. 1A compares signal peptide regions (human, 
SEQ ID NO:40; porcine, SEQ ID NO:37, amino acids 1-19; murine, 
SEQ ID NO: 6, amino acids 1-19) . Note that the amino acids in 
Fig. 1A-1H are numbered at the first Alanine of the mature 
protein as number 1, with amino acids of the signal peptide 
assigned negative numbers. The Human fVIII sequence in SEQ ID 
NO: 2 also begins with the first Alanine of the mature protein 
as amino acid number 1 . In the amino acid sequences of mouse 
fVIII (SEQ ID NO: 6) and porcine fVIII (SEQ ID No:37), the 
first amino acid (alanine) of the mature sequence is amino 
acid number '20. Fig. " 1A-1H shows an alignment of the 
corresponding sequences of human, mouse and pig fVIII, such 
that - the regions of ' greatest amino acid identity are 
juxtaposed. " The amino acid numbers in Fig. 1A-1H apply to 
human fVIII only. Fig. IB gives the amino acid sequences for 
the Al domain, of- human (SEQ ID NO:2, amino acids 1-372), 
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porcine (SEQ ID NO: 37, amino acids 20-391) , and murine (SEQ ID 
NO:6, amino acids 20-391). Fig. 1C provides amino acid 
sequences for the Factor VIII A2 domains from human (SEQ ID 
NO:2, amino acids 373-740), pig (SEQ ID NO:37, amino acids 
5 392-759) and mouse (SEQ ID NO:6, amino acids 392-759) . Fig. 

ID provides the amino acid sequences of B domains of human 
factor VIII (SEQ ID NO:2, amino acids 741-1648), pig (SEQ ID 
NO:37, amino acids 760-1449) and mouse (SEQ ID NO : 6 , amino 
acids 760-1640) . Fig. IE compares the amino acid sequences of 

10 Factor VIII light chain activation peptides of human, pig and 

mouse (SEQ ID NO:2, amino acids 1649-1689; SEQ ID NO:37, amino 
acids 1450-1490; and SEQ ID NO : 6 , amino acids 1641-1678, 
respectively) . Fig. IF provides the sequence comparison for 
human, pig and mouse Factor VIII A3 domains (SEQ ID NO: 2, 

15 amino acids 1690-2019; SEQ ID NO:37, amino acids 1491-1820; 

and SEQ ID NO : 6 , amino acids 1679-2006, respectively. Fig. 1G 
provides the amino acid sequences of the Factor VIII: CI 
domains of human, pig and mouse (SEQ ID NO: 2, amino acids 
2020-2172; SEQ ID NO: 37, amino acids 1821-1973; and SEQ ID 

20 NO:6, amino acids 2007-2159, respectively). Fig. 1H provides 

-v.* sequence data for the C2 domains of the Factor VIII C2 domains 
^ L.. of human, pig and mouse (SEQ ID NO: 2, amino acids 2173-2332; 
SEQ ID NO: 37, amino acids 1974-213 3; and SEQ ID NO: 6, amino 
acids 2160-2319, respectively) . 

25 * 

. The diamonds represent tyrosine sulfation sites, 
potential glycosylation sites are in bold type, proposed 
binding sites for Factor IXa, phospholipid and Protein C are 
double -underlined, and regions involved in binding anti-A2 and 

30 anti-C2 inhibitory antibodies are italicized. Asterisks 

highlight amino acid sequences which are conserved. See also 
SEQ ID NO: 36 (porcine factor VIII cDNA) and SEQ ID NO: 37 
(deduced amino acid sequence of porcine factor VIII) . The 
human numbering system is used as the reference [Wood et al ; 

35 (19 84) supra] . The Al , A2 , and B domains are defined by 

thrombin- cleavage sites at positions 372 and 740 and an 

99 



BNSDOCID: <WO S946274A1 J_> 



WO 99/46274 



PCT/US99/05193 



unknown protease cleavage site at 1648 as residues 1-372, 373- 
740, and 741-1648, respectively [Eaton, D.L. et al . (1986) 
Biochemistry 25:8343-8347] . The A3 , CI , . and C2 domains are 
defined as residues 1690-2019, 2020-2172, and 2173-2332, 
respectively [Vehar et al . (1984) supra] . Cleavage sites for 
thrombin (factor Ila) , factor IXa, factor Xa and APC [Fay et 
al. (1991) supra; Eaton, D. et al . (1986) Biochemistry 
25:505-512; Lamphear, B.J. et al . (1992) Blood 80:3120-31281 
are shown by placing the enzyme name over the reactive 
arginine . An acidic peptide is cleaved from the fVIII light 
chain by thrombin or factor Xa at position 1689. Proposed 
binding sites for factor IXa [Fay, P.J. et al . (1994) J. Biol. 
Chem. 269 : 20522-20527 ; Lenting, P.J. et al . (1994) J . Biol. 
Chem. 269 : 7150-7155) , phospholipid (Foster, P. A. et al . (1990) 
Blood 75:1999-2004) and protein C (Walker, F.J. et al . (1990) 
J". Biol. Chem. 265.: 1484-1489] are doubly underlined. Regions 
involved in binding anti-A2 [Lubin et al . (1994) supra; Healey 
et al. (1995) supra] ; and previously proposed for anti-C2 
inhibitory antibodies are italicized. The C2 inhibitor 
epitope identified as herein described (human amino acids 
2181-2243) is shown by a single underline in Fig. 1H. Tyrosine 
sulfation sites [Pittman et al . (1992) supra; Michnick et al . 
(1994) supra] are shown by ♦. Recognition sequences for 
potential' N-linked glycosylation (NXS/T, where X is not 
proline) are shown in bold. 

The nucleotide sequence encoding the factor VIII protein 
lacking the B domain is given in SEQ ID NO: 38, and the 
corresponding deduced amino acid sequence is provided in SEQ 
ID NO': 39. ' : •' - - 
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CLAIMS 

1. Isolated and purified DNA comprising a nucleotide 
sequence encoding the amino acid sequence of porcine 
factor VIII set forth in SEQ ID NO: 37. 

2. DNA of claim 1 comprising the nucleotide sequence set 
forth in SEQ ID NO: 36. 

3 . Isolated and purified DNA comprising a nucleotide 

sequence encoding the Al domain of porcine factor VIII as 
set forth in SEQ ID NO: 37 from amino acids 20-391. 

4 . DNA of claim 3 comprising the nucleotide sequence set 

forth in SEQ ID NO: 36 from positions 58-1173. 

5. Isolated and purified DNA comprising a nucleotide 
sequence encoding the A3 domain of porcine factor VIII as 
set forth in SEQ ID NO: 37 from amino acids 1491-1820. 

6 . DNA of claim 5 comprising the nucleotide sequence ^set 
i.7i forth in SEQ ID NO: 36 from positions 4471-5460. 

7. Isolated and purified DNA comprising a nucleotide 
sequence encoding the CI domain of porcine factor VIII as 
set forth in SEQ ID NO:37 from amino acids 1821-1973. 

8 . DNA of claim 7 comprising the nucleotide sequence set 
forth in SEQ ID NO:36 from positions 5461-5919. 

9.. Isolated and purified DNA comprising a nucleotide^ 
sequence encoding the C2 domain of porcine factor VIII as 
set forth in SEQ ID NO: 37 from amino acids 1974-2133. 

10. DNA of claim 9 comprising the nucleotide sequence set 
forth in SEQ ID NO: 36 from positions 5920-6399. 
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11. DNA of claim 1 wherein said nucleotide sequence encodes 
the amino, acid- sequence set forth in SEQ ID No: 39. 

12. - DNA of claim 11 comprising the nucleotide sequence set 

forth in SEQ ID No:38. 

13. DNA encoding human/porcine hybrid factor VIII comprising 
a substitution of porcine factor VI.II coding DNA wherein 
nucleotide sequence encoding amino acids number 2181-2243 
of human factor VIII is substituted by nucleotides 
encoding amino acids 1982-2044 of SEQ ID NO: 37. 

14. DNA according to claim 13 wherein the porcine factor VIII 
coding DNA is nucleotides 5944-613 2 of SEQ ID NO: 36. 

15. DNA encoding porcine, factor VIII comprising codons 
encoding amino acids 20-2133 of SEQ ID NO: 37. 

16. DNA according to claim 15 having the sequence of 
nucleotides 58-6399 of SEQ ID NO: 36. 

17. DNA encoding B-domainless porcine factor VIII comprising 
codons encoding amino acids 20-1443 of SEQ ID NO: 39. 

18. DNA according to claim ..17 having the sequence of 
nucleotides 60-4331 of SEQ . ID NO: 38. 

19. : Isolated and purified protein expression product of the 

DNA of claim 17. 

20. Isolated and purified protein expression product of the 
DNA of claim 18 . , 

21. Modified human factor VIII comprising an amino acid 
- . substitution at one or more of position 2181-2243 

according to. SEQ ID N0:2, said substitution being 
insertion of an innnunoreactivity-reducing amino acid for 
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the naturally-occurring amino acid, said modified factor 
VIII having procoagulant activity. 



22. The modified factor VIII of claim 21 wherein the 
substitution is made at one or more of position 2181, 
2182, 2195, 2196, 2197, 2199, 2207, 2216, 2222, 2224, 
2225, 2226, 2227, 2228, 2234, 2238, or 2243 according to 
SEQ ID NO : 2 . 

23. The modified factor VIII of claim 21 wherein the 
substitution is made at one or more of position 2181, 
2195, 2196, 2207, 2226, 2227, 2228, and 2234. 

24. A method for modifying a factor VIII such that reactivity 
to an inhibitory antibody is reduced and procoagulant 
activity is retained, comprising substituting an 
immunoreactivity- reducing amino acid for the naturally - 
occurring amino acid at at least one position within 
amino acids number 2181-2243 according to SEQ ID NO : 2 . 

25. The method of claim 24 wherein an amino acid is 
•-»■ substituted within amino acids number 2181-2243 according 

to SEQ ID NO:2, by expressing DNA encoding a modified 
factor VIII, the DNA having at least one codon modified 
to encode the amino acid substitution within said amino 
acids number 2181-2243. 



26. The method of claim 24 wherein the immunoreactivity- 
reducing amino acid is alanine, methionine, lencine, 
serine or glycine. 

27. The method of claim 26 wherein the immunoreactivity 
reducing amino acid is alanine. 

28. The method of claim 24 wherein 1 substitution of an 
immunoreactivity-reducing amino acid is made at at least 
one of positions number 2181, 2182, 2195, 2196, 2197, 
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2199, 2207, 2216, 2222, 2224, 2225, 2226, 2227, 2228, 
2234, 2238 or 2243 according to SEQ ID NO:2. 

29. A method of making porcine factor VIII comprising 
expressing DNA encoding the amino acid sequence set forth 
in SEQ ID NO: 37 including at least amino acids 20-2133 in 
a suitable mammalian host cell in a culture medium and 
purifying the factor VIII protein from said cell or from 
said culture medium. 

30. The method of claim 29 wherein the DNA has a nucleotide 
sequence essentially as set forth in SEQ ID NO: 36, 
including at least nucleotides 58-6399. 

31. The method of claim 29 wherein the DNA encodes the amino 
acid sequence set forth in SEQ ID NO: 37/ 

32. The method of claim 31 wherein the DNA has the nucleotide 
sequence set forth in SEQ ID NO: 36* 

33. A method of making B-domainless porcine factor VIII 
comprising expressing a DNA encoding the amino acid 
sequence set forth in SEQ ID NO: 39 including at least 
amino acids 2 0-1443 in a suitable mammalian host, cell in 
a culture medium and purifying the factor VIII protein 

.from said cell or from said culture medium. 

34. . The method of claim 33 wherein the DNA has a nucleotide 

sequence essentially as set forth in SEQ ID NO: 38, 
including at least nucleotides 60-4331. 

35. The method of claim 33 wherein the DNA encodes the amino 
acid sequence set forth in SEQ ID N0:39. 

36. The method of claim 33 wherein the DNA has the nucleotide 
sequence set forth in SEQ ID NO: 38 from nucleotide number 
3 through nucleotide number 4331. 
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Signal peptide 

Human -19 MQIELSTCFF LCLLRFCFS 

P1g MQLELSTCVF LCLLPLGFS FIG. 1A 

House HQIALFACFF LSLFNFCSS 



Al domain 

Human 1 ATRRYYLGAV ELSWOYHQSD LG-ELPVDAR FPPRVPKSFP FNTSVVYKKT 

Pig AIRRYYLGAV ELSWDYRQSE LLRELHVDTR FPATAPGALP LGPSVLYKKT* - D 

House AIRRYYLGAV ELSWNYIQSO LLSVLHTDSR FLPRMSTSFP FNTSIMYKKT r,,J - 1 B 

*★ A AAA A *** ★ *** * ** ★ * * * * * * 



50 LFVEFTOHLF NIAKPRPPWH GLLGPTIQAE VYDTVVITLK NHASHPVSLH 

VFVEFTDQLF SVARPRPPWH GLLGPTIQAE VYDTWVTLK NMASHPVSLH 

VFVEYKDQLF NIAKPRPPWH GLLGPTIWTE VHDTVVITLK NHASHPVSLH 
*** * ** * ****** ******* * * **** *** ********** 

100 AVGVSYWKAS EGAEYDDQTS QREKEDDKVF PGGSHTYVWQ VLKENGPHAS 
AVGVSFWKSS EGAEYEDHTS QREKEDDKVL PGKSQTYVWQ VLKENGPTAS 
AVGVSYWKAS EGDEYEDQTS QHEKEDOKVF PGESHTYVWQ VLKENGPHAS 

**"*** ** * -A-*-*--* ** <• ******* ** + -A-* 

150 DPLCLTYSYL SHVDLVKDLN SGLIGALLVC REGSLAKEKT QTLHKFILLF 
DPPCLTYSYL SHVDLVKDLN SGLIGALLVC REGSLTRERT QNLHEFVLLF 
DPPCLTYSYM SHVOLVKDLN SGLIGALLVC KEGSLSKERT QHLYQFVLLF 

Ar******** ********** ********** **** * * * * * *** 

200 AVFDEGKSWH SETKNSLHQD RDAASARAWP KHHTVNGYVN RSLPGLIGCH 

AVFDEGKSWH SARNDSWTRA HDPAPARAQP AHHTVNGYVN RSLPGLIGCH 

AVFDEGKSWH SETNDSYTQS HDSASARDWP KHHTVNGYVN RSLPGLIGCH 
********** * * * * ** * ********* ********** 

250 RKSVYWHVIG HGTTPEVHSI FLEGHTFLVR NHRQASLEIS PITFLTAQTL 

KKSVYWHVIG HGTSPEVHSI FLEGHTFLVR HHRQASLEIS PLTFLTAQTF 

RKSYYWHV I G MGTTPEIHSI FLEGHTFFVR NHRQASLEIS PITFLTAQTL 
********* *** ** *** ******* ** ********* ********* 

APC/IXa ♦ 
300 LHDLGQFLLF CHISSHQHDG HEAYVKVDSC PEEPQLRMKN NEEAEDYDDD 
LHDLGQFLLF CHISSHHHGG HEAHVRYESC AEEPQLRRKA DE-EEDYDDN 
LIOLGQFLLF CHISSHKHDG HEAYVKVDSC PEESQWQKKN NN-EEHEDYD 

* ******** ****** * * *** * * ** ** * * * * 

Ila/Xa 

350 LTDSEHDVVR FDODNSPSFI QIR 
LYOSDHDVVR LDGDOVSPFI QIR 
DDLYSEHDHF TLDYDSSPFI QIR 

** *** 
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A2 domain 

Human 373 SVAKKHPKTW VHYIAAEEED WDYAPLVLAP DORSYKSQYL NNGPQRIGRK 
Pig SVAKKHPKTW VHYISAEEED WDYAPAVPSP SDRSYKSLYL NSGPQRIGRK FIG 1C 

Mouse . SVAKKYPKTW IHYISAEEED WDYAPSVPTS ONGSYKSQYL SNGPHRIGRK 
***** **** *** ***** ***** * **** ** ** ***** 

423 YKKVRFMAYT OETFKTREAI QHESGILGPL LYGEVGDTLL IIFKNQASRP 

YKKARFVAYT DVTFKTRKAI PYESGILGPL LYGEVGOTLL IIFKNKASRP 

YKKVRFIAYT DETFKTRETI QHESGLLGPL LYGEVGDTLL IIFKNQASRP 

*** ** *** * ***** * *** **** ****-****** *-***-* **** 

- A2 Inhibitor epitope 

473 YNIYPHGITD VRPLYSRRLP KGVKHLKDFP ILPGEIFK.YK. WTVTYEDGPT 
YNIYPHGITO VSALHPGRLL KGWKHLKDHP ILPGETFXYK WTVTVEDGPT 
YNIYPHGITD VSPLHARRLP RGIKHVKDLP IHPGEIFKYK HTVTYEDGPT 
********** * * ** ******** *** **** *** **** *** 

F.IXa binding 
APC 

523 KSDPRCLTRY YSSFVNHERD LASGLIGPLL ICYKE SVDOR GNO IMSDKRN 
KSDPRCLTRY YSSSINLEKD LASGLIGPLL ICYKESVOQR GNQMMSDKRN 
KSDPRCLTRY YSSFINPERO LASGLIGPLL ICYKESVOQR GNQMMSDKRN 
********** *** * * * ********** ********** *** ****** 

573 VILFSVFDEN RSWYLTENIQ RFLPNPAGVQ LEDPEFQASN IMHSINGYYF 
VILFSVFDEN QSWYLAENIQ RFLPNPDGLQ PQDPEFQASN IMHSINGYYF 
VILFSIFDEN QSWYITENMQ RFLPNAAKTQ PQOPGFQASN IMHSINGYYF 
***** **** *** ** * ***** * ***** ********** 

623 DSLQLSVCLH EVAYWYILSI GAQTDFLSVF FSGYTFKHKM VYEDTLTLFP 
DSLQLSVCLH EVAYWYILSV GAQTDFLSVF FSGYTFKHKM VYEDTLTLFP 
DSLELTVCLH EVAYWHILSV GAQTDFLSIF FSGYTFKHKM VYEDTLTLFP 
*** * **** ***** ♦** ******** * ********** ********** 

673 FSGETVFMSM ENPGLWILGC HNSDFRNRGM TALLKVSSCO KNTGDYYEDS 

FSGETVFMSM ENPGLWVLGC HNSDLRNRGM TALLKVYSCD RDIGDYYDNT 

FSGETVFMSM ENPGLWVLGC HNSOFRKRGM TALLKVSSCO KSTSDYYEEI 
********** ********** **** * *** ********** * *** 

♦ Ila/Xa/APC 

J23 YEDISAYLLS KNNAIEPR . 

YEDIPGFLLS GKNVIEPR 

YEDIPTQLVN ENNVIOPR 
**** * * * ** 
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HuST^l SFSQNSRHPS TRQKQFNATT IPENDIEKTD PWJHRTPHP "QMJSSSDL 
Pia SFAQNSRPPS ASQKQFQTIT SPEDDVE-LD PQSQERTQAL EELSVPSGDG r^- ID 

MoL SFFQNTNHPN TRKKKFKDST IPKNOMEKIE PQFEEIAEML KVQSVSVSDH 

***** ******** * * 

791 LHURQS-PT PHGLSLSOLQ EAKYETFSOO PSPGAIDSNH SLSEMTHFRP 
SMLLGQN-PA PHGSSSSOLQ EARNEA--00 YLPGARERNT APSAAARLRP 
LHLLGQSHPT PHGLFLSDGQ EAIYEAIHDO HSPNAIDSNE GPSKVTQLRP 
... • *** ********** * ** 

840 QLHHSGDHVF TPESGLQLRL NEKLGTTAAT ELKKLDFKVS ST-SNNLIS- 

ELHHSAERVl TPEP EK ELKKLDSKMS SSSDILKTSP 

ESHHSEKIVF TPQPGLQLRS NKSLETTIEV KWKKLGLQVS StPSNLMTT- 

**. * ** *** * * 

888 TIPSONLAAGT OKTSSLGPPS HPVHYOSQLD TTLFGKKSSP LTESGGPLSL 
T PSOTLSAET ERTHSLGPPH PQVNFRSQLG AIVLGKNSSH FIGASVPLGS 
TILSDNLICATF EKTOSSGFPD MPVHSSSKLS TTAF6KKAYS LVGSHVPLKA 
****** * * * * * * * ** *~* 

939 SEENNDSKLL ESGLMNSQES SWGKKYSSTE SGRLFKGKRA HGPALLTKDN 

TE £ D .._ HES SLGEHYSPVE SDGIFEKERA HGPASLTKDD 

SEENSOSNIL OSTLHYSQES LPROMLSIE NDRLLREKRF HGIALLTKDN 
»* ** * * * ** * **** 

989 ALFKVSISLL KTNKTSNNSA TNRKTHIDGP SLLIEHSPSV WQNILESDTE 
VLFKVNISLV KTNKARVYIK TNRKIHIDOA ALLTENRAS- 
TLFKDNVSLM KTNKTYNHST TNEKLHTESP TSIENSTTDL QDAIUCVNSE 

"~ 1039 FKKVTPLIHO RMLMDKNATA LRLNHHSWCT TSSKNMEMVQ QKKEGPIPPO 

ATFHOKNTTA SGLNHVSN 

IQEVTALIHO GTLLGKNSTY LRLNHMLHRT TSTKNKDIFH RKDEOPIPQO 

1089 AQNPDMSFFK MLFLPESARVf IQRTHGKNSL HSGQGPSPKQ l-VSLGPEKSV 

__ w IKGPLGKNPL SSERGPSPEL ITSSGSGKSV 

EENTIHPFSK MIFLSESSNW FKKTNGNNSl NSEQEHSPKQ LVYLMFKKYV 

* **** *** ** 

1139 EGQNFLSEKN KVVVGKGEFT KDVGLKEMVF PSSRNIFLTN LOHLHENKTH 
KGQSSGQGRI RVAVEEEELS KG — KEHHL PNSELTFLTN SADVQGHDTH 
KNQSFLSEKN KVTVEQDGFT KNIGLKOHAF PHNHSIFLTT LSNVHENGRH 



* * * * * *** * * 



1189 NQEKKIQEEI EKKETLIQEN VYLPQIHTVT GTKNFHKNLF LLSTRQNVEG 
SQGKKSREEH ERREKIVQFK VDLPQVYTAT GTKNFLRNIF HQSTEPSVEG 
NQEKNIQEEI EK-EAIIEEK VVLPQVHEAT GSKNFLKDIL ILGTRQNI- 
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1433 HFIQGAKKNN LSLAILTIEM TGOQREVGSL GTSATNSVTY KKVENTVLPK 
PILQGAKRNH LSLRFLTLEH AGGQGKISAL GKSAAGPLAS GKLEKAVLSS 
NFLKETKINN PSLAILPWNM FIDQGJCFTSP GKSNTNSVTY KKRENIIFLK 



FIG. 1D 
(cont.) 



1483 PDLPKTSGKV ELLPKVHIYQ KDIFPTETSN GSPGHLDLVE GSLLQGTEGA 
AGLSEASGXA EFLPKYRVHR EDLLPQKTSN VSCAHGOLGQ EIFLQKTRGP 
PTLPEESGKI ELLPQVSIQE EEILPTETSH GSPGHLNLMK EYFLQKIQGP 



1533 IKWNEANRPG KVPFLRVATE SSAKTPSKLL OPLAWDNHYG TQIPKEEWKS 

VNLNKVNRPG RTPSKLL --G PPMPKE-WES 

TKWNKAKRHG ESIKGKTES- -SKNTRSKLL NHHAWDYHYA AQIPKDHWKS 



1583 QEKSPEKTAF KKKDTI-LSLN ACESNHAIAA INEGQNKPEI EVTWAKQGRT 
LEKSPKSTAL RTKDIISLPLD RHESHHSIAA 1CNEGQAETQR EAAHTKQGGP 
KEKSPEIISI KQEOTI-ISLR PHGNSHSIGA -NEKQNWPQR ETTWVKQGQT 
**** * * ****** *- 



1633 ERLCSQ.NPPY LKRHQR 
GRLGAPKPPV LRRHQR 
QRTCSQIPPV LKRHQR 



*** * **** 



Light chain activation peptide . 

♦ ♦ Ila/Xa 

Human 1649 EITRTTLQSOQEEIOYOOTISVEHKKEDFOIYOEDEHQSPR 
Pig 0 1 SLPTRQPEE0KHOYO0 IFSTETKGEOFOIYGEDENQOPR 

House EL--SAFQSEQEATOYDOAITIET-IEDFOIYSEDIKQGPR 

* * **** * ****** ** * ** 
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A3 domain 

IXa Xa 

Human 1690 SFQKKTRHYF IAAVERLWDY GHSSSPHVLR NRAQSGSVPQ FKKVVFQEFT 

Pig SFQKRTRHYF IAAYEQLWDY GHSESPRALR NRAQNGEVPR FKKVYFRERA 

House SVQQKTRHYF IAAYERLHOY GHSTS-HVLR NRYQSDNVPQ FKKVVFQEFT 

* * ***** ***** **** *** * ** ** * ** ****** * 

1740 DGSFTQPLYR GELNEHLGLL GPYIRAEVED NIMVTFRNQA SRPYSFYSS1 

DGSFTNPSYR GELNKHLGLL GPYIRAEVED NIHYTFKNQA SRPYSFYSSL 

DGSFSQPLYR GELNEHLGLL GPYIRAEVED NIHYTFKNQA SRPYSFYSSL 
**** * ** **** ***** ********** ****** *** ********** 

Factor IXa binding 
1790 ISYEEOOROG AFPRKNFVKP NETKTYFWKV OHHHAPTKDE FDCKAWAYFS 
ISYPDOQEQG AEPRHNFVQP NETRTYFHKV QHHMAPTEDE FOCKAHAYFS 
ISYKEDQR-G EEPRRNFVKP HETKIYFWKV QHHMAPTEDE FDCKAWAYFS 

*** ** * *** *** * *** ***** ********** ********** 



1840 DVDLEKDVHS GLIGPLLVCH TNTLNPAHGR QVTVQEFALF FTIFDETKSW 
OVDLEKDVHS GLIGPLLICR ANTLNAAHGR QVTVQEFALF FTIFDETKSW 
DVDLERDHHS GLIGPLLICH ANTLNPAHGR QVSVQEFALL FTIFDETKSW 

***** * ** ******* * **** **** ** ****** ********** 



1890 YFTENHERNC RAPCNIQMED PTFKENYRFH AINGYIMDTL PGLVMAQOQR 
YFTENVERNC RAPCHLQHED PTLKENYRFH AINGYVMDTL PGLVHAQNQR 
YFTENVKRNC KTPCNFQMEO PTLKENYRFH AINGYVMDTL PGLVMAQOQR 
***** *** ** **** ** ******* ********** ******* ** 

1940 IRWYLLSMGS NENIHSIHFS GHVFTYRKKE EYKMALYNLY PGVFETVEML 
IRWYLLSMGS NENIHSIHFS GHVFSYRKKE EYKMAVYNLY PGVFETVEML 
IRWYLLSMGN NENIQSIHFS GHVFTVRKKE EYKMAVYNLY PGVFETLEMI 
********* **** ***** **** ***** ***** **** ****** ** 

Protein C binding 
1990 PSKAGIWRVE CLIGEHL HAG MSTLFLVY SN 
PSKVGIWRIE CLIGEHLQAG MSTTFLVYSK 
PSRAGIWRVE CLIGEHLQAG MSTLFLVYSK 

** **** * ******* ** *** ****** 
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CI domain 



Human 2020 KCQTPLGMAS GHIRDFQITA SGQYGQWAPK LARLHYSGSI NAWSTKFPFS 

Pig ECQAPLGMAS GRIRDFQITA SGQYGQWAPK LARLHYSGSI NAWSTKDPHS cio 

Mouse QCQIPLGHAS GSIROFQITA SGHYGQWAPN i^L^iGSr SS^kSfs RG * 1G 

*r***** * ******** ^ ****** ********** ****** * * 

2070 SKSHXE "!S!K8Si « QKFSSLYIS Q^IMYSLDG KKWQTYRGNS 

W f Jf vni i"adm Ji2rfKS? A R Q KFSSLYIS QFIIMYSLDG RhwQSYRGNS 

******«!* IVHGIKTQGA RQKFSSLYIS QFIIMYSLDG KKWLSYQGNS 

********** * *** **** ********** ********** * * *** 

2120 Srl-uwcc^ S^F" 1 FNp PHARYI RLHPTHYSIR STLRMELMGCDLN 
TS^cS VOASGIKHNI FNPPIVARYI RLHPTHYSIR STLRMELHGCDLN 
TGTLMVFFGN VOSSGIKHNS FNPPIIARYI RLHPTHSSIR STLRMELMGCDLN 
** ****** ***** **** ****** *** ************* 



C2 domain 

Human 2173 SCSMPLGMES KAIfDAQITA SSYFTNMFAT WSPSKARLHL QGRSNAWRPQ 

u l c f£f ™ ^ KAISDSQITA SSHLSNIFAT WSPSQARLHL QGRTNAWRPR cio 1U 

Mouse SCSIPLGMES KVISQTQITA SSYFTNMFAT WSPSQARLHL QGRTNAWR>5 ^ 1H 

**** * *** *** * ** * *** **** ***** *** ******* 

2223 J^accK ° F 2 KTMKVTG VTTQGVKSU TSHYVKEFu'sSSQOGHQHr 
VSSAEEWLQV OLQKTVKVTG ITTQGVKSLL SSMYVKEFLV SSSODGRRWT 
VNOPKQWLQV DLQKTMXVTG IITQGVKSLF Ts!3f^EF[i SSSQOGHHVT 

* **** * * ** **** ******** ** ***** ****** ** 

2273 m XXSS^l??^ 5 ^ VVNSLDp Pl-t- TRYLRTHP"/ WHnfflfpMF 
OM vEr?I^ cSX SSTP VVNAI -0PPLF IRYLRIHPTS HAQHIALRLE 
QILYNGKVKV FQGNQOSSTP MMNSLDPPLL TRYLRIHPQI WEHQIALRLE 
* ** ********** ***** ******** • - - - - 



binding 
2323 a2£££22LY 
VLGCEAQOLY 
ILGCEAQQQY 

****** * 



**** 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: EMORY UNIVERSITY 
<ii) TITLE OF INVENTION: Modified Factor VIII 
(iii) NUMBER OF SEQUENCES: 40 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Greenlee, Winner and Sullivan, P.C. 

(B) STREET: 5370 Manhattan Circle Suite 201 

(C) CITY: Boulder 

(D) STATE: Colorado 

(E) COUNTRY: USA 

(F) ZIP: 80303 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO Unassigned \ 

(B) FILING DATE: 10-MAR-1999 

(C) CLASSIFICATION: 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/037,601 

(B) FILING DATE: 10-MAR-1998 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Greenlee, Lorance L. 

(B) REGISTRATION NUMBER: 27,894 

(C) REFERENCE /DOCKET NUMBER : 75-95G WO 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 303/499-8080 

(B) TELEFAX: 303/499-8089 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9009 base pairs 

(B) TYPE: nucleic acid 

1 
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(vi) 



(vii) 
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(C) STRANDEDNESS : double 

(D) . TOPOLOGY: not. relevant 



(ii) 



MOLECULE TYPE: cDNA to mRNA 



(iii) 



HYPOTHETICAL: NO 



(iv) 



ANTI- SENSE: NO 



(vi) 



ORIGINAL SOURCE: 
(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Liver 



(ix) 



FEATURE : 



(A) NAME /KEY : mis cofeature 

(B) LOCATION: 5125.. 7053 

(D) OTHER INFORMATION: /prdduct= "Domain Structure" 
/note= "Equivalent to the A3-C1-C2 domain" 

(ix) FEATURE: 

(A) NAME /KEY : miscjeature 

(B) LOCATION: 1..2277 

(D) OTHER INFORMATION: /product= "Domain Structure" 
/note= "Equivalent to the A1-A2 domain" 

(ix) FEATURE: 

(A) NAME /KEY : miscjeature 

(B) LOCATION: 1..2277 

(D) OTHER INFORMATION: /product^ "Domain" 
/note= "cDNA encoding human factorVIII" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAGTGGGTAA GTTCCTTAAA TGCTCTGCAA AGAAATTGGG ACTTTTCATT AAATCAGAAA 60 
TTTTACTTTT TTCCCCTCCT GGGAGCTAAA GATATTTTAG AGAAGAATTA ACCTTTTGCT " 12 0 

TCTCCAGTTG AACATTTGTA GCAATAAGTC ATGCAAATAG AGCTCTCCAC CTGCTTCTTT 180 

CTGTGCCTTT TGCGATTCTG CTTTAGTGCC ACCAGAAGAT ACTACCTGGG TGCAGTGGAA 240 

CTGTCATGGG ACTATATGCA AAGTGATCTC GGTGAGCTGC CTGTGGACGC AAGATTTCCT 3 00 

CCTAGAGTGC CAAAATCTTT TCCATTCAAC ACCTCAGTCG TGTACAAAAA GACTCTGTTT 360 

GTAGAATTCA CGGTTCACCT TTTCAACATC GCTAAGCCAA GGCCACCCTG GATGGGTCTG 420 

CTAGGTCCTA CCATCCAGGC TGAGGTTTAT GATACAGTGG TCATTACACT. TAAGAACATG 4 80 

GCTTCCCATC CTGTCAGTCT TCATGCTGTT GGTGTATCCT ACTGGAAAGC TTCTGAGGGA 54 0 

GCTGAATATG ATGATCAGAC CAGTCAAAGG GAGAAAGAAG ATGATAAAGT CTTCCCTGGT 6 00 

GGAAGCCATA CATATGTCTG GCAGGTCCTG AAAGAGAATG GTCCAATGGC CTCTGACCCA 66 0 



2 
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CTGTGCCTTA CCTACTCATA TCTTTCTCAT GTGGACCTGG TAAAAGACTT GAATTCAGGC 720 

CTCATTGGAG CCCTACTAGT ATGTAGAGAA GGGAGTCTGG CCAAGGAAAA GACACAGACC 780 

TTGCACAAAT TTATACTACT TTTTGCTGTA TTTGATGAAG GGAAAAGTTG GCACTCAGAA 84 0 

ACAAAGAACT CCTTGATGCA GGATAGGGAT GCTGCATCTG CTCGGGCCTG GCCTAAAATG 900 

CACACAGTCA ATGGTTATGT AAACAGGTCT CTGCCAGGTC TGATTGGATG CCACAGGAAA 960 

TCAGTCTATT GG CATGTG AT TGGAATGGGC ACCACTCCTG AAGTGCACTC AATATTCCTC 102 0 

GAAGGTCACA CATTTCTTGT GAGGAACCAT CGCCAGGCGT CCTTGGAAAT CTCGCCAATA 1080 
ACTTTCCTTA CTGCTCAAAC ACTCTTGATG GACCTTGGAC AGTTTCTACT GTTTTGTCAT ' 114 0 

ATCTCTTCCC ACCAACATGA TGGCATGGAA GCTTATGTCA AAGTAGACAG CTGTCCAGAG 12 00 

GAACCCCAAC TACGAATGAA AAATAATGAA GAAGCGGAAG ACTATGATGA TGATCTTACT 126 0 

GATTCTGAAA TGGATGTGGT CAGGTTTGAT GATGACAACT CTCCTTCCTT TATCCAAATT 132 0 

CGCTCAGTTG CCAAGAAGCA TCCTAAAACT TGGGTACATT ACATTGCTGC TGAAGAGGAG 13 8 0 

G AC TGGG ACT ATGCTCCCTT AGTCCTCGCC CCCGATGACA GAAGTTATAA AAGTCAATAT 144 0 

TTGAACAATG GCCCTCAGCG GATTGGTAGG AAGTACAAAA AAGTCCGATT TATGGCATAC 1500 

ACAGATGAAA CCTTTAAGAC TCGTGAAGCT ATTCAGCATG AATCAGGAAT CTTGGGACCT 1560 

TTACTTTATG GGGAAGTTGG AGACACACTG TTGATTATAT TTAAGAATCA AGCAAGCAGA 162 0 

CCATATAACA TCTACCCTCA CGGAATCACT GATGTCCGTC CTTTGTATTC AAGGAGATTA : 168 0 

CCAAAAGGTG TAAAACATTT GAAGGATTTT CCAATTCTGC CAGGAGAAAT ATTCAAATAT , 174 0 

AAATGGACAG TGACTGTAGA AGATGGGCCA ACTAAATCAG ATCCTCGGTG CCTGACCCGC v ' 1800 

TATTACTCTA GTTTCGTTAA TATGGAGAGA GATCTAGCTT CAGGACTCAT TGGCCCTCTC 186 0 

CTCATCTGCT ACAAAGAATC TGTAGATCAA AGAGGAAACC AGATAATGTC AGACAAGAGG 1920 

AATGTCATCC TGTTTTCTGT ATTTGATGAG AACCGAAGCT GGTACCTCAC AGAGAATATA 198 0 

CAACGCTTTC TCCCCAATCC AGCTGGAGTG CAGCTTGAGG ATCCAGAGTT CCAAGCCTCC 2 04 0 

AACATCATGC ACAGCATCAA TGGCTATGTT TTTGATAGTT TGCAGTTGTC AGTTTGTTTG 2100 

CATGAGGTGG CATACTGGTA CATTCTAAGC ATTGGAGCAC AGACTGACTT CCTTTCTGTC 2160 

TTCTTCTCTG GATATACCTT CAAACACAAA ATGGTCTATG AAGACACACT CACCCTATTC 2220 

CCATTCTCAG GAGAAACTGT CTTCATGTCG ATGGAAAACC CAGGTCTATG G ATT CTGGGG 2 280 
TGCCACAACT • CAGACTTTCG GAACAGAGGC ATGACCGCCT TACTGAAGGT TT CTAGTTGT 234 0 
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GACAAGAACA 


CTGGTGATTA 


TTACGAGGAC 


AGTTATGAAG 


ATATTTCAGC 


ATACTTGCTG 


2400 


AGTAAAAACA 


ATGCCATTGA 


ACCAAGAAGC 


TTCTCCCAGA 


ATTCAAGACA 


CCCTAGCACT 


2460 


AGGCAAAAGC 


AATTTAATGC 


CACCACAATT 


C C AG AAAATG 


ACATAGAGAA 


GACTGACCCT 


2520 


TGGTTTGCAC 


ACAGAACACC 


TATGC CTAAA 


ATACAAAATG 


TCTCCTCTAG 


TGATTTGTTG 


2580 


ATGCTCTTGC 


GACAGAGTCC 


TACTCCACAT 


GGGCTATCCT 


TATC TGATCT 


CCAAGAAGCC 


2640 


AAATATGAGA 


CTTTTTCTGA 


TGATCCATCA 


CCTGGAGCAA 


TAGACAGTAA 


TAACAGCCTG 


2700 


TCTGAAATGA 


CACACTTCAG 


GCCACAGCTC 


CATCACAGTG 


GGGACATGGT 


ATTTACC CCT 


2760 


GAGTCAGGCC 


TCCAATTAAG 


ATTAAATGAG 


AAACTGGGGA 


CAACTGCAGC 


AACAGAGTTG 


2820 


AAGAAACTTG 


ATTTCAAAGT 


TTCTAGTACA 


TCAAATAATC 


TGATTTCAAC 


AATTCCATCA 


2880 


GACAATTTGG 


CAGCAGGTAC 


TGATAATACA 


AGTTCCTTAG 


GACCCCCAAG 


TATGCCAGTT 


2940 


CATTATGATA 


GTCAATTAGA 


TACCACTCTA 


TTTGGCAAAA 


AGTCATCTCC 


CCTTACTGAG 


3000 


TC TGGTGG AC 


CTCTGAGCTT 


GAGTGAAGAA 


AATAATGATT 


CAAAGTTGTT 


AGAATCAGGT 


3060 


TTAATGAATA 


GCCAAGAAAG 


TTCATGGGGA 


AAAAATGTAT 


CGTCAACAGA 


GAGTGGTAGG 


3X20 


TTATTTAAAG 


GGAAAAGAGC 


TCATGGACCT 


GCTTTGTTGA 


CTAAAGATAA 


TGCCTTATTC 


3180 


AAAGTTAGCA 


TCTCTTTGTT 


AAAGACAAAC 


AAAACTTCGA 


ATAATTCAGC 


AACTAATAGA 


3240 


AAGACTCACA 


TTGATGGCCC 


ATCATTATTA 


ATTGAGAATA 


GTCCATCAGT 


CTGGCAAAAT 


3300 


ATATTAGAAA 


GTGACACTGA 


GTTTAAAAAA 


GTGACACCTT 


TGATTCATGA 


CAGAATGCTT 


3360 


ATGGACAAAA 


ATGCTACAGC 


TTTGAGGCTA 


AAT CAT ATGT 


CAAATAAAAC 


TACTTCATCA 


3420 


AAAAACATGG 


AAATGGTCCA 


ACAGAAAAAA 


GAGGGCCCCA 


TTCCACCAGA 


TGCACAAAAT 


3480 


CCAGATATGT 


CGTTCTTTAA 


GATGCTATTC 


TTGCCAGAAT 


CAGCAAGGTG 


GATACAAAGG 


3540 


ACTCATGGAA 


AGAACTCTCT 


GAACTCTGGG 


CAAGGCCCCA 


GTCCAAAGCA 


ATTAGTATCC 


3600 


TTAGGACCAG 


AAAAATCTGT 


GGAAGGTCAG 


AATTTCTTGT 


CTGAGAAAAA 


CAAAGTGGTA 


3660 


GTAGGAAAGG 


GTGAATTTAC 


AAAGGACGTA 


GGACTCAAAG 


AGATGGTTTT 


TCCAAGCAGC 


3720 


AGAAACCTAT 


TTCTTACTAA 


CTTGGATAAT 


TTACATGAAA 


ATAATACACA 


CAATCAAGAA 


3780 


AAAAAAATTC 


AGGAAGAAAT 


AGAAAAGAAG 


GAAACATTAA 


TCCAAGAGAA 


TGTAGTTTTG 


3840 


CCTCAGATAC 


ATACAGTGAC 


TGGCACTAAG 


AATTTCATGA AGAACCTTTT 


CTTACTGAGC 


3900 


ACTAGGCAAA 


ATGTAGAAGG 


TTCATATGAG 


GGGGCATATG 


CTCCAGTACT 


TCAAGATTTT 


3960 


AGGTCATTAA 


ATGATTCAAC 


AAATAGAACA 


AAGAAACACA 


CAG CTCATTT 


CTCAAAAAAA 


4020 
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GGGGAGGAAG AAAACTTGGA AGG CTTGGG A AATCAAACCA AGCAAATTGT AGAGAAATAT 4080 

GCATGCACCA CAAGGATATC TCCTAATACA AGCCAGCAGA ATTTTGTCAC GCAACGTAGT 414 0 

AAGAGAG CTT TGAAACAATT CAGACTCCCA CTAGAAGAAA CAGAACTTGA AAAAAGGATA 420 0 

ATTGTGGATG ACACCTCAAC CCAGTGGTCC AAAAACATGA AACATTTGAC CCCGAGCACC 4260 

CTCACACAGA TAGACTACAA TGAGAAGGAG AAAGGGGCCA TTACTCAGTC TCCCTTATCA 432 0 

GATTGCCTTA CGAGGAGTCA TAGCATC CCT CAAGCAAATA GATCTCCATT ACCCATTGCA 438 0 

AAGGTATCAT CATTTCCATC TATTAGACCT ATATATCTGA CCAGGGTCCT ATTCCAAGAC 444 0 

AACTCTTCTC ATCTTCCAGC AGCATCTTAT AGAAAGAAAG ATT CTGGGGT CCAAGAAAGC 4 500 

AGTCATTTCT TACAAGGAGC CAAAAAAAAT AACCTTTCTT TAG C C ATT CT AAC CTTGG AG 4 56 0 

ATGACTGGTG ATCAAAGAGA GGTTGGCTCC CTGGGGACAA GTG CCACAAA TTCAGTCACA 462 0 

TACAAGAAAG TTGAGAACAC TGTTCTCCCG AAACCAGACT TGCCCAAAAC ATCTGGCAAA 46 8 0 

GTTGAATTGC TTC CAAAAGT TCACATTTAT CAGAAGGACC TATTCC CTAC GGAAACTAGC 4 74 0 

AATGGGTCTC CTGGCCATCT GGATCTCGTG GAAGGGAGCC TTCTTCAGGG AACAGAGGGA 4 800 

GCGATTAAGT GGAATGAAGC AAACAGACCT GGAAAAGTTC C CTTTCTG AG AGTAGCAACA 4 86 0 

GAAAGCTCTG CAAAGACTCC CTCCAAGCTA TTGGATCCTC TTGCTTGGGA T AAC CACTAT 4 92 0 

GGTACTCAGA TACCAAAAGA AGAGTGGAAA TCCCAAGAGA AGTCAC CAGA AAAAACAGCT 4 98 0 

TTTAAGAAAA AGGATACCAT TTTGTCCCTG AACGCTTGTG AAAGCAATCA TGCAATAGCA .504 0 

GCAATAAATG AGGGACAAAA TAAGCCCGAA ATAGAAGTCA CCTGGGCAAA GCAAGGTAGG 5100 

ACTGAAAGGC TGTGCTCTCA AAACCCACCA GTCTTGAAAC GCCATCAACG GGAAATAACT 5160 

GGTACTACTC TTCAGTCAGA TCAAGAGGAA ATTGACTATG ATGATACCAT ATCAGTTGAA 522 0 

ATGAAGAAGG AAGATTTTGA CATTTATGAT GAGGATGAAA ATCAGAGCCC CCGCAGCTTT 5280 

CAAAAGAAAA CACGACACTA TTTTATTGCT GCAGTGGAGA GGCTCTGGGA TTATGGGATG 5340 

AGTAGCTCCC CACATGTTCT AAGAAACAGG GCTCAGAGTG GCAGTGTCCC TCAGTTCAAG 5400 

AAAGTTGTTT TCCAGGAATT TACTGATGGC TCCTTTACTC AGCCCTTATA CCGTGGAGAA. 546 0 

CTAAATGAAC ATTTGGGACT CCTGGGGCCA TATATAAGAG CAGAAGTTGA AGATAATATC 5520 
ATGGTAACTT TCAGAAATCA GGCCTCTCGT CCCTATTCCT TCTATTCTAG CCTTATTTCT 558 0 

TATGAGGAAG ATCAGAGGCA AGGAGCAGAA CCTAGAAAAA ACTTTGTCAA GCCTAATGAA 5640 
AC CAAAACTT ACTTTTGGAA AGTGCAACAT CATATGGCAC CCACTAAAGA TGAGTTTGAC 5700 
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TGCAAAGCCT 


GGGCTTATTT 


CTCTGATGTT 


GACCTGGAAA 


. AAGATGTGGA 


CTCAGGCCTG 


5760 


' ATTGGACCCC 


TTCTGGTCTG 


CCACACTAAC 


ACACTGAACC 


CTGCTCATGG 


GAGACAAGTG 


5820 


ACAGTACAGG 


AATTTGCTCT 


GTTTTTCACC 


ATCTTTGATG 


AGACCAAAAG 


CTGGTACTTC 


5880 


.. ACTGAAAATA 


TGGAAAGAAA 


CTGCAGGGCT 


CCCTGCAATA 


TCCAGATGGA 


AGATCCCACT 


5940 


TTTAAAGAGA 


ATTATCGCTT 


CCATGCAATC 


AATGGCTACA 


TAATGGATAC 


ACTACCTGGC 


6000 


TTAGTAATGG 


CTCAGGATCA 


AAGGATTCGA 


TGGTATCTGC 


TCAGCATGGG 


CAGCAATGAA 


6060 


AACATCCATT 


CTATTCATTT 


CAGTGGACAT 


GTGTTCACTG 


TACGAAAAAA 


AGAGGAGTAT 


6120 


AAAATGGCAC 


TGTACAATCT 


CTATCCAGGT 


GTTTTTGAGA 


CAGTGGAAAT 


GTTACCATCC 


6180 


AAAG CTGGAA 


TTTGGCGGGT 


GGAATGCCTT 


ATTGGCGAGC 


ATCTACATGC 


TGGGATGAGC 


6240 


ACACTTTTTC 


TGGTGTACAG 


CAATAAGTGT 


CAGACTCCCC 


TGGGAATGGC 


TTCTGGACAC 


6300 


ATTAGAGATT 


TTCAGATTAC 


AGCTTCAGGA 


CAATATGGAC 


AGTGGGCCCC 


AAAGCTGGCC 


6360 


AGACTTCATT 


ATTCCGGATC 


AATCAATGCC 


TGGAGCACCA 


AGGAGCCCTT 


TTCTTGGATC 


6420 


AAGGTGGATC 


TGTTGGCACC 


AATGATTATT 


CACGGCATCA 


AGACCCAGGG 


TGCCCGTCAG 


6480 


AAGTTCTCCA 


GCCTCTACAT 


CTCTCAGTTT 


ATCATCATGT 


ATAGTCTTGA 


TGGGAAGAAG 


6540 


TGGCAGACTT 


ATCGAGGAAA 


TTCCACTGGA 


ACCTTAATGG 


TCTTCTTTGG 


CAATGTGGAT 


6600 


TCATCTGGGA 


TAAAACACAA 


TATTTTTAAC 


CCTCCAATTA 


TTGCTCGATA 


CATCCGTTTG 


6660 


CACCCAACTC 


ATTATAGCAT 


TCGCAGCACT 


CTTCGCATGG 


AGTTGATGGG 


CTGTGATTTA 


6720 


AATAGTTGCA 


GCATGCCATT 


GGGAATGGAG 


AGTAAAGCAA 


TATCAGATGC 


ACAGATTACT 


6780 


GCTTCATGCT 


ACTTTAGCAA 


TATGTTTGCC 


ACCTGGTCTC 


CTTCAAAAGC 


TCGACTTCAC 


6840 


CTCCAAGGGA 


GGAGTAATGC 


CTGGAGACCT 


CAGGTGAATA 


ATCCAAAAGA 


GTGGCTGCAA 


6900 


GTGGACTTCC 


AGAAGACAAT 


GAAAGTCACA 


GGAGTAACTA 


CTCAGGGAGT 


AAAATCTCTG 


6960 


CTTACCAGCA 


TGTATGTGAA 


GGAGTTCCTC 


ATCTCCAGCA 


GTCAAGATGG 


CGATCAGTGG 


7020 


ACTCTCTTTT 


TTCAGAATGG 


CAAAGTAAAG 


GTTTTTCAGG 


GAAATCAAGA 


CTCCTTCACA 


7080 


CCTGTGGTGA 


ACTCTCTAGA 


CCCACCGTTA 


CTGACTCGCT 


ACCTTCGAAT 


TCACCCCCAG 


7140 


AGTTGGGTGC 


ACCAGATTGC 


CCTGAGGATG 


GAGGTTCTGG 


GCTGCGAGGC 


ACAGGACCTC 


7200 


TACTGAGGGT 


GGCCACTGCA 


GCACCTGCCA 


CTGCCGTCAC 


CTCTCCCTCC 


TCAGCTCCAG 


7260 


GGCAGTGTCC CTCCCTGGCT 


TGCCTTCTAC 


CTTTGTGCTA 


AATCCTAGCA 


GACACTGCCT 


7320 


TGAAGCCTCC 


TGAATTAACT 


ATCATCAGTC 


CTGCATTTCT 


TTGGTGGGGG 


GCCAGGAGGG 


7380 
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TGCATCCAAT TTAACTTAAC TCTTACCTAT TTTCTGCAGC TGCTCCCAGA TTACTCCTTC 744 0 

CTTCCAATAT AACTAGGCAA AAAGAAGTGA GGAGAAACCT GCATGAAAGC ATTCTTCCCT 7 500 

GAAAAGTTAG GCCTCTCAGA GTCACCACTT CCTCTGTTGT AGAAAAACTA TGTGATGAAA 756 0 

CTTTGAAAAA GATATTTATG ATGTTAACAT TTCAGGTTAA GCCTCATACG TTTAAAATAA 762 0 

AACTCTCAGT TGTTTATTAT CCTGATCAAG CATGGAACAA AGCATGTTTC AGGATCAGAT 7680 

CAATACAATC TTGGAGTCAA AAGGCAAATC ATTTGGACAA TCTGCAAAAT GGAGAGAATA 774 0 

CAATAACTAC TACAGTAAAG TCTGTTTCTG CTTCCTTACA CATAGATATA ATTATGTTAT 78 00 

TTAGTCATTA TGAGGGGCAC ATTCTTATCT CCAAAACTAG CATTCTTAAA CTGAGAATTA 7 86 0 

TAGATGGGGT TCAAGAATCC CTAAGTCCCC TGAAATTATA TAAGGCATTC TGTATAAATG 7 92 0 

CAAATGTGCA TTTTTCTGAC GAGTGTCCAT AGATATAAAG CCATTGGTCT TAATTCTGAC 7 98 0 

CAATAAAAAA ATAAGTCAGG AGGATGCAAT TGTTGAAAGC TTTGAAATAA AATAACATGT 8 04 0 

CTTCTTGAAA TTTGTGATGG CCAAGAAAGA AAATGATGAT GACATTAGGC TTCTAAAGGA 8100 

CATACATTTA ATATTTCTGT GGAAATATGA GGAAAATCCA TGGTTATCTG AGATAGGAGA 816 0 

-TACAAACTTT GTAATTCTAA TAATGC ACT C AGTTTACTCT CTCCCTCTAC TAATTTCCTG 822 0 

.7" CTGAAAATAA CACAACAAAA ATGTAACAGG GGAAATTATA TACCGTGACT GAAAACTAGA 82 80 

; GTCCTACTTA CATAGTTGAA ATATCAAGGA GGTCAGAAGA AAATTGGACT GGTGAAAACA 834 0 
.; GAAAAAACAC TCCAGTCTGC CATATCACCA CACAATAGGA TCCCCCTTCT TGCCCTCCAC - 84 00 

CCCCATAAGA TTGTGAAGGG TTTACTGCTC CTTCCATCTG CCTGCACCCC TTCACTATGA 8460 
CTACACAGAA CTCTCCTGAT AGTAAAGGGG GCTGGAGGCA AGGATAAGTT ATAGAGCAGT -y 8 52 0 

. TGGAGGAAGC ATCCAAAGAC TGCAACCCAG GGCAAATGGA AAACAGGAGA TCCTAATATG 8 58 0 

AAAGAAAAAT GGATCCCAAT CTGAGAAAAG GCAAAAGAAT GGCTACTTTT TTCTATGCTG 8640 

GAGTATTTTC TAATAATCCT GCTTGACCCT TATCTG AC CT CTTTGGAAAC TATAACATAG 8 7 00 

CTGTCACAGT ATAGTCACAA TCCACAAATG ATGCAGGTGC AAATGGTTTA TAGCCCTGTG 8 76 0 

AAGTTCTTAA AGTTTAGAGG CTAACTTACA GAAATGAATA AGTTGTTTTG TTTTATAGCC 8 82 0 

, CGGTAGAGGA GTTAACCCCA AAGGTGATAT GGTTTTATTT CCTGTTATGT TTAACTTGAT 888 0 

AATCTTATTT TGGCATTCTT TTCCCATTGA CTATATACAT CTCTATTTCT CAAATGTTCA 8 94 0 

TGGAACTAGC TCTTTTATTT TCCTGCTGGT TTCTTCAGTA ATGAGTTAAA TAAAACATTG 9000 

ACACATACA . " / . • ( 9009 
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( 2 ) INFORMATION FOR ' SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 amino acids 

(B) -TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY not relevant 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

;(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE; N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Liver 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
1 5 10 -15 

Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
20 25 30 

Arg Val Pro Lys- Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
35 40 45 

Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie Ala Lys Pro 
50 55 60 

Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin Ala Glu Val 
65 70 75 80 

Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser His Pro Val 
85 90 95 

Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
100 105 no 

Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
115 120 125 

Phe Pro Gly * Gly Ser His Thr Tyr Val Trp Gin - Val Leu - Lys Glu Asn 
130 135 140 

Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
145 150 ' 155 160 

His Val' Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
165 170 175 
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Leu Val Cys Arg Glu Gly Ser Leu Ala Lys GIu Lys Thr Gin Thr Leu 
180 185 190 

His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
195 200 205 

His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
210 215 220 

Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
225 230 235 240 

Ser Leu Pro Gly Leu lie Gly cys His Arg Lys Ser Val Tyr Trp His 
245 250 255 

Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
260 265 270 

Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
275 280 285 

Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
290 295 300 

Gin Phe Leu Leu Phe. Cys His lie Ser Ser His Gin His Asp Gly Met 
305 310 315 320 

Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
325 330 335 

Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
340 345 350 

Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
355 360 365 

- lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
370 375 380 

Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 
385 390 395 400 

Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro 
405 - 410 415 

Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr 
420 425 430 

Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu Ser Gly lie 
435 440 445 

Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu lie lie 
450 455 460 

Phe Lys, Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro His Gly He 
465 470 475 480 
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Thr Asp Val Arg Pro Leu Tyr Ser Arg - Arg' Leu" Pro - Lys Gly Val Lys 
485 490 495 

His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe Lys Tyr Lys 
500 505 510 

Trp Thr Val Thr Val • Glu Asp Gly Pro Thr Lys S e r Asp Pro Arg Cys 
515 520 525 

Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala ' 
530 535 540 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu Ser Val Asp 
545 550 555 560 

Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val lie Leu Phe 
565 570 575 

Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn lie Gin 
580 585 590 

Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp Pro Glu Phe 
595 600 605 

Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser 
610 615 620 

Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu 
625 630 635 640 

• Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 

645 650 655 

Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
660 665 670 

Phe Ser Gly Glu Thr Val Phe Met Ser Met. Glu Asn Pro Gly Leu Trp 
675 680 685 

lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
690 695 700 

Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
705 710 715 720 

Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
725 730 735 

lie Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg His Pro Ser Thr Arg 
740 745 750 

Gin Lys Gin Phe Asn Ala Thr Thr lie Pro Glu Asn Asp lie Glu Lys 
755 760 765 

Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys lie Gin Asn 
770 775 780 
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Val Ser Ser Ser Asp Leu Leu Met Leu Leu Arg Gin Ser Pro Thr Pro 
785 . 790 795 800 

His Gly Leu Ser Leu Ser Asp Leu Gin Glu Ala Lys Tyr Glu Thr Phe 
805 810 815 

Ser Asp Asp Pro Ser Pro Gly Ala lie Asp Ser Asn Asn Ser Leu Ser 
820 825 830 

Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly Asp Met Val 
835 840 845 

Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu Asn Glu Lys Leu Gly 
850 855 860 

Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys Val Ser Ser 
865 870 875 880 

Thr Ser Asn Asn Leu lie Ser Thr lie Pro Ser Asp Asn Leu Ala Ala 
885 890 895 

Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met Pro Val His 
900 905 910 

Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys Ser Ser Pro 
915 920 925 

Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu Asn Asn Asp 
930 935 940 

Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu Ser Ser Trp 
945 950 955 960 

Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe Lys Gly Lys 
965 970 975 

Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn . Ala Leu Phe Lys 
980 985 990 

Val Ser lie Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn Asn Ser Ala 
995 1000 1005 

Thr Asn Arg Lys Thr His lie Asp Gly Pro Ser Leu Leu He Glu Asn 
1010 1015 1020 

Ser Pro Ser Val Trp Gin Asn He Leu Glu Ser Asp Thr Glu Phe Lys 
1025 1030 1035 1040 

Lys Val Thr Pro Leu He His Asp Arg Met Leu Met -Asp Lys Asn Ala 
1045 . 1050 1055 

Thr Ala Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr Ser Ser Lys 
1060 1065 1070 

Asn Met Glu Met Val Gin Gin Lys Lys Glu Gly Pro He Pro Pro Asp 
1075 1080 1085 
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Ala Gin Asn Pro Asp Met Ser Phe Phe Lys Met Leu Phe Leu Pro Glu 
1090 1095 1100 

Ser Ala Arg Trp He Gin Arg Thr His Gly Lys Asn Ser Leu Asn Ser 

1110 ins H20 

Gly Gin Gly Pro Ser Pro Lys Gin Leu Val Ser Leu Gly Pro Glu Lys 
1125 1130 H35 

Ser Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys Val Val Val 
1140 H45 iiso 

Gly Lys Gly Glu Phe Thr. Lys Asp Val Gly Leu Lys Glu Met Val Phe 
1155 1160 H65 

Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn Leu His Glu 
1170 1175 H80 

Asn Asn Thr His Asn Gin Glu Lys Lys He Gin Glu Glu lie Glu Lys 
1185 1190 H95 1200 

Xys Glu Thr Leu He Gin Glu Asn Val Val Leu. Pro Gin He His Thr 
1205 1210 1215 

Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu Leu Ser Thr 
1220 1225 1230 

Arg Gin Asn Val Glu Gly Ser Tyr Glu Gly Ala Tyr Ala Pro Val Leu 
1235 1240 1245 

Gin Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn Arg Thr Lys Lys His 
1250 1255 1260 

Thr Ala His Phe Ser Lys Lys Gly Glu Glu Glu Asn Leu Glu Gly Leu 
1265 1270 1275 1280 

Gly Asn Gin Thr Lys Gin He Val Glu Lys Tyr Ala Cys Thr- Thr Arg 
1285 1290 1295 

lie Ser. Pro < Asn Thr Ser Gin Gin Asn Phe Val Thr Gin Arg Ser Lys 
1300 1305 1310 

Arg Ala Leu Lys Gin Phe Arg Leu Pro Leu Glu Glu Thr Glu Leu Glu 
1315 1320 1325 

Lys Arg He He Val Asp Asp Thr Ser Thr Gin Trp Ser. Lys Asn Met 
1330 1335 1340 

Lys- His . Leu. Thr. Pro Ser ; Thr .Leu „ Thr Gin He Asp Tyr ;Asn: Glu Lys 
1345 1350 1355 1360 

- Glu Lys Gly Ala He Thr Gin Ser Pro Leu Ser Asp Cys Leu Thr Arg 
1365 1370 .1375 

Ser- His- Ser He Pro Gin Ala Asn . Arg Ser Pro Leu Pro He. Ala Lys 
1380 1385 1390 
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Val Ser Ser Phe Pro Ser He Arg Pro He Tyr Leu Thr Arg Val Leu 
1395 1400 1405 

Phe Gin Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr Arg Lys Lys 
1410 1415 1420 

Asp Ser Gly Val Gin Glu Ser Ser His Phe Leu Gin Gly Ala Lys Lys 
1425 1430 1435 1440 

Asn Asn Leu Ser Leu Ala He Leu Thr Leu Glu Met Thr Gly Asp Gin 
1445 1450 1455 

Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser Val Thr Tyr 
1460 1465 1470 

Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu Pro Lys Thr 
1475 1480 1485 

Ser Gly Lys Val Glu Leu Leu Pro Lys Val His He Tyr Gin Lys Asp' 
1490 1495 1500 

Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His Leu Asp Leu 
1505 1510 1515 1520 

Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala He Lys Trp Asn 
1525 1530 . 1535 

Glu Ala Asn Arg Pro Gly Lys Val Pro Phe Leu Arg Val Ala Thr Glu 
1540 1545 1550 

Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu Ala Trp Asp 
1555 1560 1565 

Asn His Tyr Gly Thr Gin He Pro Lys Glu Glu Trp Lys Ser Gin Glu 
X570 1575 1580 

Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr He Leu Ser , 
1585 1590 1595 1600 

Leu Asn Ala Cys Glu Ser Asn His Ala He Ala Ala He Asn Glu Gly 
1605 1610 1615 

Gin Asn Lys Pro Glu lie Glu Val Thr Trp Ala Lys Gin Gly. Arg Thr 
1620 1625 1630 

Glu Arg Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg 
1635 1640 1645 

■ Glu lie Thr Arg Thr Thr Leu Gin Ser Asp , Gin Glu Glu He Asp Tyr 
1650 1655 1660 

-Asp Asp Thr -He Ser Val Glu Met Lys Lys Glu Asp' Phe Asp He Tyr 
1665 1670 1675 1680 

Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys Thr Arg 
1685 1690 1695 
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His Tyr Phe He Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser 
1700 1705 1710 

Ser Ser Pro His Val Leu Arg Asn Arg Ala . Gin Ser Gly Ser Val Pro 
1715 1720 1725 

Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly Ser: Phe Thr 
1730 1735 1740 

Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly 
1745 1750 1755 1760 

Pro Tyr He Arg Ala Glu Val Glu Asp Asn lie Met Val Thr Phe Arg 
1765 1770 1775 

Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu He Ser Tyr 
1780 1785 1790 

Glu. Glu Asp Gin Arg Gin Gly, Ala Glu Pro Arg Lys Asn Phe Val Lys 
1795 1800 1805 

Pro Asn Glu Thr Lys Tfctr Tyr Phe Trp Lys Val Gin His- His Met Ala 
1810 1815 1820 

Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp 
1825 1830 1835 1840 

Val Asp Leu Glu Lys Asp Val His Ser Gly Leu He Gly Pro Leu Leu 
1845 1850 1855 

Val Cys His Thr Asn Thr. Leu Asn Pro Ala His Gly Arg Gin Val Thr 
1860 1865 1870 

Val Gin Glu Phe Ala Leu Phe Phe ;Thr Jle Phe Asp Glu Thr Lys Ser 
1875 1880 1885 

Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn 
1890 1895 1900 

. lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala 
1905 1910 1915 1920 

- lie Asn Gly Tyr He Met Asp Thr Leu Pro Gly Leu Val Met Ala Gin 

1925 1930 1935 

Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn 
1940 1945 1950 

■: r Ile..His Ser, lie His Phe Ser Gly, His Val - Phe. Thr Val-ArgLys Lys 
1955 1960 1965 

Glu Glu , Tyr Lys Met Ala; Leu Tyr Asn Leu Tyr Pro Gly Val ■ Phe Glu 
1970 1975 1980 

, Thr Val. Glu, Met Leu Pro Ser Lys. Ala Gly lie Trp Arg Val; Glu Cys 
1985 1990 . - 1995 2000 
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Leu lie Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val 
2005 2010 2015 

Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser Gly His lie 
2020 2025 2030 

Arg Asp Phe Gin He Thr Ala Ser Gly Gin Tyr Gly Gin Trp Ala Pro 
2035 2040 2045 

Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He Asn Ala Trp Ser Thr 
2050 2055 2060 

Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu Leu Ala Pro Met He 
2065 2070 2075 2080 

He His Gly He Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser Ser Leu 
2085 2090 2095 

Tyr He Ser Gin Phe He He Met Tyr Ser Leu Asp Gly Lys Lys Trp 
2100 2105 2110 

Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly 
2115 2120 2125 

Asn Val Asp Ser Ser Gly He Lys His Asn He Phe Asn Pro Pro He 
2130 2135 2140 

He Ala Arg Tyr He Arg Leu His Pro Thr His Tyr Ser He Arg Ser, 
2145 2150 2155 2160 

Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met 
2165 2170 2175 

• : - Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp Ala Gin He Thr Ala. 

2180 2185 2190 

• Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala*v 
2195 2200 2205 

Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin Val Asn 
2210 2215 2220 

Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr Met Lys Val 
2225 2230 2235 2240 

Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr Ser Met Tyr 
2245 2250 2255 

Val Lys Glu Phe Leu He Ser Ser Ser Gin Asp Gly His Gin Trp Thr 
2260 2265 2270 

Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp 
2275 2280 2285 

- Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg 
2290 2295 2300 
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Tyr Leu Arg : lie -His Pro Gin Ser Trp Val His Gin lie Ala Leu Arg 
2305 2310 2315 2320 

Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
■ ' . 2325 2330 

(2) INFORMATION FOR SEQ ID NO : 3 : — 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 0 base pairs 
■(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not. relevant 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porcine 
(F) TISSUE TYPE: blood 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1; .1130 

(D) OTHER INFORMATION: /product^ "region" 
/note= "cDNA encoding A2 domain of porcine factorVIII" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TAAGCACCCT AAGACGTGGG TGCACTACAT . CTCTGCAGAG GAGGAGGACT GGGACTACGC 6 0 

CCCCGCGGTC CCCAGCCCCA GTGACAGAAG TTATAAAAGT CTCTACTTGA ACAGTGGTCC 12 0 

TCAGCGAATT GGTAGGAAAT ACAAAAAAGC TCGATTCGTC GCTTACACGG ATGTAACATT 18 0 

TAAGACTCGT AAAGCTATTC CGTATGAATC AGGAATCCTG GGACCTTTAC TTTATGGAGA 24 0 

AGTTGGAGAC ACACTTTTGA TTATATTTAA GAATAAAGCG AGCCGACCAT ATAACATCTA 300 

CCCTCATGGA ATCACTGATG TCAGCGCTTT GCACCCAGGG AGACTTCTAA AAGGTTGGAA 36 0 

ACATTTGAAA GACATGCCAA TTCTGCCAGG AGAGACTTTC AAGTATAAAT GGACAGTGAC 42 0 

TGTGGAAGAT GGGCCAACCA AGTCCGATCC TCGGTGCCTG ACCCGCTACT ACTCGAGCTC 48 0 

CATTAATCTA GAGAAAGATC TGGCTTCGGG ACTCATTGGC CCTCTCCTCA TCTGCTACAA 54 0 

AG AAT CTGTA GACCAAAGAG GAAACCAGAT. GATGTCAGAC AAGAGAAACG TCATCCTGTT 600 

TTCTGTATTC GATGAGAATC AAAGCTGGTA CCTCGCAGAG AATATTCAGC GCTTCCTCCC 66 0 

CAATCCGGAT GGATTACAGC CCCAGGATCC AGAGTT CCAA GCTTCTAACA TCATGCACAG 72 0 
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CATCAATGGC TATGTTTTTG ATAGCTTGCA GCTGTCGGTT TGTTTGCACG AGGTGGCATA 780 

CTGGTACATT CTAAGTGTTG GAGCACAGAC GGACTTCCTC TCCGTCTTCT TCTCTGGCTA 840 

CACCTTCAAA CACAAAATGG TCTATGAAGA CACACTCACC CTGTTCCCCT T CTCAGGAG A 900 

AACGGTCTTC ATGTCAATGG AAAACC CAGG TCTCTGGGTC CTAGGGTGCC ACAACTCAGA 960 

CTTG CGGAAC AGAGGGATGA CAGCCTTACT GAAGGTGTAT AGTTGTGACA GGGACATTGG 102 0 

TGATTATTAT G AC AAC AC TT ATGAAGATAT TCCAGGCTTC TTGCTGAGTG GAAAGAATGT 10 8 0 

CATTGAACCC AGAAGCTTTG CCCAGAATTC AAGACCCCCT AGTGCGAGCA 113 0 
( 2 ) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 368 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porcine 
(F) TISSUE TYPE: spleen 

(ix) FEATURE: 

(A) NAME / KEY : Protein 

(B) LOCATION: 1 . . 368 

(D) OTHER INFORMATION : /note= "Predicted amino acid 
sequence of porcine factor VIII A2 domain , defined as residues 
homologous to human factor VIII, amino acids 3 73-740. Residues 
1-4 are from known porcine amino acid sequence." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His Tyr lie Ser Ala 
1 5 10 15 

Glu Glu Glu Asp Trp Asp Tyr Ala Pro Ala Val Pro Ser .Pro Ser Asp 
20 25 30 

Arg Ser Tyr Lys Ser Leu Tyr Leu Asn Ser Gly Pro Gin Arg lie Gly 
35 40 45 

Arg Lys Tyr Lys Lys Ala Arg Phe Val Ala Tyr Thr Asp Val Thr Phe 
50 55 60 . 
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Lys Thr Arg Lys Ala 
65 

Leu Tyr Gly Glu Val 
85 

Ala Ser Arg Pro Tyr 
100 

Ala Leu His Pro Gly 
115 

Met Pro lie Leu Pro 
130 

Val Glu Asp Gly Pro 
145 

Tyr Ser Ser Ser lie 
165 

Gly Pro Leu Leu lie 
180 

Gin Met Met Ser Asp 
195 

Glu Asn Gin Ser Trp 
210 



lie Pro Tyr Glu Ser Gly 
70 75 

Gly Asp Thr Leu Leu lie 
90 

Asn lie Tyr Pro His Gly 
105 

Arg Leu Leu Lys Gly Trp 
120 

Gly Glu Thr Phe Lys Tyr 
135 

Thr Lys Ser Asp Pro Arg 
150 155 

Asn Leu Glu Lys Asp Leu 
170 

Cys Tyr Lys Glu Ser Val 
185 

Lys Arg Asn Val lie Leu 
200 

Tyr Leu Ala Glu Asn lie 
215 



lie Leu Gly Pro Leu 
80 

lie Phe Lys Asn Lys 
95 

lie Thr Asp Val Ser 
110 

Lys His. Leu Lys Asp 
125 

Lys Trp Thr Val Thr 
140 

Cys Leu Thr Arg Tyr 
160 

Ala Ser Gly Leu lie 
175 

Asp Gin Arg Gly Asn 
190 

Phe Ser Val Phe Asp 
205 

Gin Arg Phe Leu Pro 
220 



Asn Pro Asp Gly Leu Gin Pro Gin Asp Pro Glu Phe Gin Ala Ser Asn 
225 230 235 240 

lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser Leu Gin. Leu Ser 
245 250 255 



Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu Ser Val Gly Ala 
260 265 270 

Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr Thr Phe Lys His 
275 280 285 • 

Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro Phe Ser Gly Glu 
290 295 300 

Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp Val Leu Gly Cys 
305 310 315 320 

His Asn Ser Asp Leu Arg Asn Arg Gly Met Thr Ala- Leu Leu Lys Val 
325 330 335 

Tyr Ser Cys Asp Arg Asp lie Gly Asp Tyr Tyr Asp Asn Thr Tyr Glu 

■■ : 340 ' 345 350 

- Asp lie Pro Gly. Phe Leu Leu Ser Gly Lys Asn Val lie Glu Pro Arg 
355 360 365 

18 



BNSDOCID; <WO. 



9946274A1 J_> 



WO 99/46274 



PCT/US99/05193 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: CDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MUS musculus 



(ix) FEATURE: 

(A) NAME /KEY : repeat_unit 

(B) LOCATION: 1..407 

(D) OTHER INFORMATION: /rpt_type= "terminal" 
/note= " 5' UTR" 



( ix ) FEATURE : 

(A) NAME /KEY : misc__f eature 

(B) LOCATION: 7471.. 7476 

(D) OTHER INFORMATION: /function^ "polyA signal" 



(ix) FEATURE: 

(A) NAME /KEY : repeat_unit 

(B) LOCATION: 7368.. 7493 

(D) OTHER INFORMATION: /rpt_type= "terminal" 
/note- w 3' UTR" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION : 408 . . 7367 

(D) OTHER INFORMATION: /product= "coagulation factor VII I" 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Elder, F. 

Lakich, D . 
Gitschier, J. 

(B) TITLE: Sequence of the murine Factor VIII cDNA 

(C) JOURNAL: Genomics 

(D) VOLUME: 16 

(F) PAGES: 374-379 

(G) DATE: 1993 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TCTAGAGTTT CTTTGCTACA GGTACCAAGG AACAGTCTTT TAGAATAGGC TAGGAATTTA 
AAT AC AC CTG AACGCCCCTC CTCAGTATTC TGTTCCTTTT CTTAAGGATT CAAACTTGTT 
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AGGATGCACC 


C AG CAG G AAA 


TGGGTTAAGC 


V- X x AVjy x 




tattccagtt 


180 


TTCGTGTGCC 


TGCTTCCTAC 


TACCCAAAAG 


GAArTPA ATPP 


X X LAXjAI Lib 


ttttgtgcta 


240 


ATGGTACTTT 


C ACT CACAGT 


AGATAAACTT 


PPAfiAAfiATf 


f ,r P/*' ,r rV/"* A TV TV TV 


tatttaggac 


300 


TTTTTACTAA 


ATCATTACAT 


TTCTTTTTGT 


TPTTAAAAGP 


iAAAol 1 AIT 


ttagagaaga 


~k & n 

•J O \J 


GTTAAATTTT 


CATTTCTTTA 


G T TGAACAT T 


TTPTAf?TA Z1T 

X X L X X J\J-i. X 


A TV TV Tl /~t /~i ^ 

AAAACjC cat g 


CAAA T AG PA P 




TCTTCGCTTG 


CTTCTTTCTG 


AGCCTTTTPA 

"»jw. x x x x v^ri 


■ri.X X X L X l l 


1AGTGCCATC 


AGAAGATAPT 


4 8 0 


ACCTTGGTGC 


AGTGGAATTG 


TCCTGGAACT 


■fiirti 1 LAVj A\j 


1 bATCTGCTC 


AGTGTGCTGC 


54 0 


AT ACAGACT C 


AAGATTTCTT 


C C T AG AATGT 


PAAPSTPTTT 
LrtALfi. X L x J. X 


x LLATTCAAC 


ACCTCCATCA 


600 


TGTATAAAAA 


GACTGTGTTT 


GTAGAGTACA 


AGnAPPAfSPT 


111 LftALATT 


GCCAAGCCCA 


660 


GGCCACCCTG 


GATGGGTTTG 


CTAGGTCCTA 


PPATTTf3f3AP 


1 bAbb 1 1 LAl 


GACACAGTGG 


720 


TCATTACACT 


TAAAAACATG 


GCTTCTPATP 


L X VJ X V — M.L* 1L1 


1 lA 1 bLTbTT 


GGTGTGTCCT 


780 


ACTGGAAAGC 


TTCTGAGGGA 


GATGAATATG 


AAfiATPAniir 


AAb l LAAAI G 


GAGAAGGAAG 


840 


ATGATAAAGT 


TTTCCCTGGT 


GAAAGTCATA 


PTTATCTTTP 

L X X X L7 XXX 


buAAb X ll. 1 tj 


AAAGAGAATG 


900 


GTCCAATGGC 


CTCTGACCCT 


CCATGTPTPA 


PTTAPTPZiTZi 

V— X X J-1L X L-ii X A 


1 A 1 (j 1 LTCAT 


GTGGATCTGG 


960 


TGAAAGATTT 


GAATTCAGGC 


CTCATTGGAG 


LXLl LtL X A\J X 


A r Pf r ""T l A A A A A 

AJ.bI AAAbAA 


GGCAGTCTCT 


1020 


CCAAAGAAAG 


AACACAGATG 


TTGTACCAAT 


1 lui X IjL X 


1111 LjC I GTA 


TTTGATGAAG 


1080 


GGAAGAGCTG 


GCACTCAGAA 




V— X XAXAlAlA 


bib 1ATGGAT 


TCTGCATCTG 


1140 


CTAGAGACTG 


GCCTAAAATG 


P AP AP AGT P A 


rtX vjVjl. liilur X 


A A A O A /^t/'~««T , /~i«T« 

AAAUAGGTCT 


CTTCCAGGTC 


1200 


TGATTGGATG 


CCATAGGAAA 


TCAGTPTAPT 


PGP A PTJTf^ A T* 


X ^-JvjAA 1 bjbjVaL 


ACCACTCCTG ' 


1260 


AAATACACTC . 


AATATTCCTC 


GAAGGTPAPA 


x X XX X XV3X 


bAbbAAb LAL 


CGTCAAGCTT 


1320 


CATTGGAGAT 


ATCACCAATA 


APTTTPPTTA 

XXX V— L X X 


V— X \JlL X LAAAL 


Ab- 1L1 1 GAT A 


GATCTTGGGC 


1380 


AGTTCCTACT 


ATTTTGTCAT 


ATCTCTTPPP 


ATAAAPATna 


1 LiVjxLA 1 bbAA 


GCTTATGTCA 


1440 


AAGTAGATAG 


CTGCCCTGAG 


G AATP PP A AT 

VJrVfi ALL Unn X 




AAA T 1 A A »Ti A A rri 

AAA 1 AA I AAT 


GAGGAAATGG 


1500 


AAGATTATGA 


TGATGATCTT 


T ATT C AGAAA 


•x WUrt X X o X X 


lALAI IbbAi 


TATGACAGCT . 


1560 


CTCCTTTTAT 


CCAAATTCGG 


TCGGTTGPTA 


AAAACTaPPP 


T* A AAA HTTPP 
1 AAAAL, 1 1 Vjbj 


ATACATTATA 


1620 


TTTCTGCTGA 


GGAGGAAGAC 


TGGGACTATG 


PAPPTTPAflT 
UAL. V — XX Lno X 


lLLl ALL 1 lLj 


GATAATGGAA 


1680 


GTTATAAAAG. CCAGTATCTG 


AGCAATGGTC 


CTCATCGGAT 


tggtaggaaa 


TATAAAAAAG 


1740 


TCAGATTTAT ; 


AGCATACACA - 


GATGAAACCT 


TTAAGACTCG 


tgaaactatt 


CAGCATGAAT 


1800 
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CAGGACTCTT GGG AC CTTT A CTTTATGGAG AAGTTGGAGA CACACTGTTG ATTATTTTTA 186 0 

AGAATCAAGC AAGCCGACCA TATAACATTT ACCCTCATGG AATCACTGAT GTCAGTCCTC 192 0 

TACATGCAAG GAGATTGCCA AGAGGTATAA AGCACGTGAA GGATTTGCCA ATTCATCCAG 198 0 

GAG AG AT ATT CAAGTACAAG TGGACAGTTA CAGTAGAAGA TGGACCAACT AAATCAGATC 204 0 

CACGGTGCCT GACCCGCTAT TATTCAAGTT TCATTAACCC TGAGAGAGAT CTAGCTTCAG 210 0 

GACTGATTGG CCCTCTTCTC ATCTGCTACA AAGAATCTGT AGATCAAAGG GGAAACCAGA 2160 

TGATGTCAGA CAAAAGAAAT GTCATCCTGT TTTCTATATT TGATGAGAAC CAAAGCTGGT 222 0 

ACATCACAGA GAACATGCAA CGCTTCCTCC CCAATGCAGC TAAAACACAG CCCCAGGACC 228 0 

CTGGGTTCCA GGCCTCCAAC ATCATGCACA GCATCAATGG CTATGTTTTT GATAGCTTGG 234 0 

AGTTGACAGT TTGTTTGCAT GAGGTGGCAT ACTGGCACAT TCTCAGTGTT GGAGCACAGA 24 00 

CAGACTTCTT ATCTATCTTC TTCTCTGGAT ATACTTTCAA ACACAAAATG GTCTATGAAG 246 0 

ATACACTTAC CCTGTTCCCA TTCTCAGGAG AAAC TGTCTT TATGTCGATG GAAAACCCAG 2 52 0 

GTCTATGGGT CTTGGGGTGT CATAATTCAG ACTTTCGGAA GAGAGGTATG ACAGCATTGC 2580 

— TGAAAGTTTC TAGTTGTGAC AAGAGCACTA GTGATTATTA TGAAGAAATA TATGAAGATA 264 0 

: - ~TTCCAACACA GTTGGTGAAT GAGAACAATG TCATTGATCC CAGAAGCTTC TTCCAGAATA 2 700 

: CAAATCATCC TAATACTAGG -AAAAAGAAAT TCAAAGATTC CACAATTCCA AAAAATGATA 276 0 

TGGAGAAGAT TGAGCCTCAG TTTGAAGAGA TAGCAGAGAT GCTTAAAGTA CAGAGTGTCT 2820 

• CAGTTAGTGA CATGTTGATG CTCTTGGGAC AGAGTCATCC TACTCCACAT GGCTTATTTT 2 880 

TATCAGATGG CCAAGAAGCC ATCTATGAGG CTATTCATGA TGATCATTCA CCAAATGCAA 2 940 

TAGACAGCAA TGAAGGCCCA TCTAAAGTGA CCCAACTCAG GCCAGAATCC CATCACAGTG 3 000 

AGAAAATAGT ATTTACTCCT CAGCCCGGCC TCCAGTTAAG ATCCAATAAA AGTTTGGAGA 306 0 

CAACTATAGA AGTAAAGTGG AAGAAACTTG GTTTGCAAGT TTCTAGTTTG CCAAGTAATC 3120 

TAATGACTAC AACAATTCTG TCAGACAATT TGAAAGCAAC TTTTGAAAAG ACAGATTCTT 3180 

CAGGATTTCC AGATATGCCA GTTCACTCTA GTAGTAAATT AAGTACTACT GCATTTGGTA 324 0 

AGAAAGCATA * TTCCCTTGTT * GGGTCTCATG TACCTTTAAA CGCGAGTGAA GAAAATAGTG 3 300 

ATTCCAACAT ATTGGATTCA ACTTTAATGT ATAGTCAAGA AAGTTTACCA AGAGATAATA 3360 
TATTATCAAT AGAGAATGAT AGATTACTCA GAGAGAAGAG GTTTCATGGA ATTGCTTTAT . 342 0 
TGACCAAAGA TAATACTTTA TTCAAAGACA ATGTCTCCTT AATGAAAACA AACAAAACAT ' 3480 
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ATAATCATTC 


AACAACTAAT 


GAAAAACTAC 


ACACTGAGAG 


CCCAACATCA 


ATTGAGAATA 


3540 


GTACAACAGA 


CTTGCAAGAT 


GCCATATTAA 


AGGTCAATAG 


TGAGATTCAA 


GAAGTAACAG 


3600 


CTTTGATTCA 


TGATGGAACA 


CTTTTAGGCA 


AAAATTCTAC 


ATATTTGAGA 


CTAAACCATA 


3660 


TGCTAAATAG 


AACTACCTCA 


ACAAAAAATA 


AAGACATATT 


TCATAGAAAA 


■ GATGAAGATC 


3720 


CTATTCCACA AGATGAAGAG 


AATACAATCA 


TGCCATTTTC 


CAAGATGTTG 


TTCTTGTCAG 


3780 


AATCTTCAAA 


TTGGTTTAAA 


AAGACCAATG 


GAAATAATTC 


CTTGAACTCT 


GAGCAAGAAC 


3840 


ATAGTG C AAA 


- GCAATTAGTA TATTTAATGT 


TTAAAAAATA 


TGTAAAAAAT 


CAAAGTTTCT 


3900 


TGTCAGAGAA 


AAATAAAGTC 


^i.v_^l\J x nUn/iL 


AGGATGGATT 


TACAAAGAAC 


ATAGGACTTA 


3960 


AAGACATGGC 


TTTTCCACAT 


A a T A Tf2 a rz c a 


TATTTCTTAC 


CACTTTGTCT 


AACGTACATG 


4020 


AAAATGGTAG 


GCACAATCAA 


GAAAAAAATA 


TTCAGGAAGA 


GATAGAGAAG 


GAAGCACTAA 


4080 


TTGAAGAGAA 


AGTAGTTTTG 


CCCCACiClTClC* 

- ^— V_ V — f\\J\J X \JJ<_ 


ACGAAGCAAC 


TGGCTCTAAG 


AATTTCTTGA 


4140 


AAGACATATT 


GATACTAGGC 


APTAfJrtPAA A 


ATATAAGTTT 


ATATGA-.GTA 


CATGTACCAG 


4200 


TACTTCAAAA 


CATCACATCA 


nX a4-U-LV_J-U-LX X 


CAACAAATAC 


AGTACAGATT 


CACATGGAGC 


4260 


ATTTCTTTAA 


AAGAAGGAAG 


rjara Ann a a a 


CAAATTCAGA 


AGGCTTGGTA . 


AATAAAACCA 


4320 


GAGAAATGGT. 


AAAAAACTAT 


CCAACrfAGA 


AGAATATTAC - 


TACTCAACGT 


AGTAAACGGG 


4380 


CTTTGGGACA 


ATTCAGACTG 


TC A A PTf A A T 


GGCTTAAAAC 


CATAAACTGT 


T CAACAC AG T 


4440 


GTATCATTAA ACAGATAGAC 




AAATGAAAAA 


GTTCATTACT 


AAATCTTCCT 


4500 


TATCAGATTC 


TTCTGTGATT 




CTCAGACAAA 


TAGTTCTGAC 


TCACACATTG 


4560 


TAAAAACATC 


AGCATTTCCA 


CCAATAGATC 


TCAAAAGGAG 


TC CATTCCAA 


AACAAATTTT 


4620 


CTCATGTTCA ' 


AGCATCATCC 


TACATTTATG 


ACTTTAAGAC 


AAAAAGTTCA 


AGAATTCAAG 


4680 


AAAGCAATAA 


TTTCTTAAAA 


GAAACCAAAA 


TAAATAACCC 


TTCTTTAGCC 


ATTCTACCAT 


4740 


GGAATATGTT 


CATAGATCAA 


GGAAAATTTA 


CCTCCCCAGG 


GAAAAGTAAC 


ACAAACTCAG 


4800 


TCACATATAA 


GAAACGTGAG 


AACATTATTT 


TCTTGAAACC 


AACTTTGCCT 


GAAGAATCTG 


4860 


GCAAAATTGA 


ATTGGTTCCT 


CAAGTTTCCA 


TTCAAGAGGA 


AGAAATTTTA 


CCTACAGAAA 


4920 


CTAGCCATGG 


ATCTCCTGGA 


CACTTGAATC 


TCATGAAAGA 


GGTCTTTCTT 


CAGAAAATAC 


4980 


AGGGG CCTAC 


TAAATGGAAT 


AAAGCAAAGA 


GGCATGGAGA 


AAGTATAAAA . 


GGTAAAACAG 


5040 


AGAGCTCTAA 


AAATACTCGC 


TCAAAACTGC 


TAAATCATCA 


TGCTTGGGAT 


TATCATTATG , 


5100 


CTGCACAGAT ACCAAAAGAT 


ATGTGGAAAT 


CCAAAGAGAA 


GTCACCAGAA. 


ATTATATCCA 


5160 
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TTAAGCAAGA GGACACCATT TTGTCTCTGA GGCCTCATGG AAACAGTCAT TCAATAGGGG 5220 

CAAATGAGAA ACAAAATTGG CCTCAAAGAG AAACCACTTG GGTAAAGCAA GGCCAAACTC 5280 

AAAGGACATG CTCTCAAATC CCACCAGTGT TGAAACGACA TCAAAGGGAA CTTAGTGCTT 534 0 

TTCAATCAGA ACAAGAAGCA ACTGACTATG ATGATGCCAT C AC C ATTGAA ACAATCGAGG 54 00 

ATTTTGACAT TTACAGTGAG GACATAAAGC AAGGTCCCCG CAG CTTTCAA CAGAAAACAA 5460 

GGCACTATTT TATTGCAGCT GTGGAACGAC TCTGGGACTA TGGGATGAGT AC ATCT CATG 5 52 0 

TTCTACGAAA TAGGTATCAA AGTGACAATG TACCTCAGTT CAAGAAAGTA GTTTTC CAGG 55 8 0 

AATTTACTGA TGGCTCCTTT AGTCAGCCCT TATATCGTGG AGAATTAAAT GAACAC CTGG 56 4 0 

GGTTGTTGGG CCCATATATA AG AG C AG AAG TTGAAGACAA CATTATGGTA ACTTTCAAAA 57 00 

ACCAGGCCTC CCGTCCCTAC TCCTTCTATT CTAGCCTCAT TTCTTATAAA GAAGATCAGA 5 760 

GAGGAGAAGA ACCTAGAAGA AACTTTGTCA AGCCTAATGA AACCAAAATT TATTTTTGGA 5820 

AAGTACAACA TCATATGGCA CCCACAGAAG ATGAGTTTGA CTGCAAGGCC TGGGCTTATT 58 80 

TCTCTGATGT TGATCTTGAA AGAGATATGC ACTCGGGATT AATTGGACCC CTTCTGATTT 5940 

GCCACGCGAA CACACTGAAT CCTGCTCATG GGAGACAAGT GTCAGTACAG GAATTTGCTC 6000 

TGCTTTTCAC TATCTTTGAT GAGACCAAGA GCTGGTACTT CACTGAAAAC GTGAAAAGGA 6060 

ACTGCAAGAC ACCCTGCAAT TTCCAGATGG AAGACCCCAC TTTGAAAGAG AATTATCGCT 612 0 

TC C ATGCAAT CAATGGTTAT GTAATGGATA CCCTACCAGG CTTAGTAATG GCTCAAGATC v 6180 

AAAGGATTCG ATGGTATCTT CTCAGCATGG GCAACAATGA GAACATCCAA TCTATTCATT - 624 0 

TCAGTGGACA . TGTTTT.CACT GTACGGAAAA AAGAGGAGTA TAAAATGGCA GTGTACAACC ,,63 00 

TCTACCCAGG TGTTTTTGAG ACTCTGGAAA TGATACCATC CAGAGCTGGA ATATGGCGAG 63 6 0 

TAGAATGCCT TATTGGCGAG CACTTACAGG CTGGGATGAG CACTCTTTTT CTGGTGTACA 642 0 

GCAAGCAGTG TCAGATTCCT CTTGGAATGG CTTCTGGAAG CATCCGTGAT TTCCAGATTA 6480 

CAGCTTCAGG ACATTATGGA CAGTGGGCCC CAAACCTGGC AAGACTTCAT TATTCCGGAT - 654 0 

CAATCAATGC CTGGAGTACC - AAGGAGCCCT TTTCTTGGAT CAAGGTAGAT CTGTTGGCAC 6 600 

CAATGATTGT TCATGGCATC AAGACTCAGG GTGCTCGTCA GAAATTTTCC AGCCTTTATA 6660 

TCTCTCAATT TATCATCATG TATAGCCTGG ATGGGAAGAA GTGGCTGAGT TATCAAGGAA 672 0 

ATTCCACTGG AACCTTAATG GTTTTCTTTG G C AATGTGG A CTCATCTGGG ATT AAG C ATA 678 0 

ATAGTTTTAA TCCTCCAATT. ATTGCTCGAT ATATCCGTTT GCACCCCACT CATTCTAGCA 684 0 
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TCCGTAGTAC TCTTCGCATG GAGTTGATGG GCTGTGATTT AAACAGTTGC AGCATACCAT 6900 

TGGGAATGGA AAGTAAAGTA ATATCAGATA CACAAATCAC TGCCTCATCC TACTTCACCA 6960 

ACATGTTTGC TACTTGGTCT CCTTCACAAG CTCGACTTCA CCTCCAGGGA AGGACTAATG 7020 

CCTGGCGACC TCAGGTGAAT GATCCAAAAC AATGGTTGCA. AGTGGACTTA CAAAAGACAA 7080 

TGAAAGTCAC TGGAATAATA ACCCAGGGAG TGAAATCTCT CTTTACCAGC ATGTTTGTGA 714 0 

AAGAGTTCCT TATTTCCAGC AGTCAAGATG GC CAT CACTG GACTCAAATT TTATACAATG 72 00 

. GCAAGGTAAA GGTTTTTCAG GGGAATCAGG ACTCATCCAC ACCTATGATG AATTCTCTAG 72 6 0 

ACCCACCATT ACTCACTCGC TATCTTCGAA TTCACCCCCA GATCTGGGAG CAC CAAATTG 732 0 

CTCTGAGGCT TGAGATTCTA GGATGTGAGG CCCAGCAGCA ATACTGAGGT AGCCTCTGCA 73 8 0 

TCACCTGCTT ATTCCCCTTC CTCAGCTCAA AGATTGTCTT AATGTTTTAT TGCTGTGAAG 744 0 

AGACACTATG ACCATGGCAA CTCTTTATAA AATAAAGCAT TTAATCAGGG CTT 74 93 
(2) INFORMATION FOR SEQ ID NO : 6 :. 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2319 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE : protein. 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal , . . 

(vi) ORIGINAL SOURCE': 

(A) ORGANISM: Mus musculus 



(x) PUBLICATION INFORMATION.: 

(A) AUTHORS: Elder, F . 

Lakich, D. 
Gitschier , J. 

(B) TITLE: Sequence of the Murine Factor VIII cDNA 

( C ) JOURNAL : Genomi c S 

• — • (D) VOLUME: 16. - .. . ■„ _ / . ^ 

(F) PAGES: 374-379 

(G) DATE: 1993 

(K) RELEVANT RESIDUES IN SEQ ID NO : 6 : FROM 1 TO 2319 



(xi) .SEQUENCE .DESCRIPTION: SEQ ID NO : 6 : 
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Met Gin lie Ala Leu Phe Ala Cys Phe Phe Leu Ser Leu Phe Asn Phe 
1 5 10 15 

Cys Ser Ser Ala lie Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

Trp Asn Tyr lie Gin Ser Asp Leu Leu Ser Val Leu His Thr Asp Ser 
35 40 45 

Arg Phe Leu Pro Arg Met Ser Thr Ser Phe Pro Phe Asn Thr Ser lie 
50 55 60 

Met Tyr Lys Lys Thr Val Phe Val Glu Tyr Lys Asp Gin Leu Phe Asn 
65 70 75 80 

lie Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie 
85 90 95 

Trp Thr Glu Val His Asp Thr Val Val lie Thr Leu Lys Asn Met Ala 
100 105 ^ 110 

Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala 
115 120 125 

Ser Glu Gly Asp Glu Tyr Glu Asp Gin Thr Ser Gin Met Glu Lys Glu 
130 135 140 

Asp Asp Lys Val Phe Pro Gly Glu Ser His Thr Tyr Val Trp Gin Val 
145 150 155 160 

Leu Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Pro Cys Leu Thr Tyr 
165 170 175 

Ser Tyr Met Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu 
180 185 190 

He Gly Ala Leu Leu Val Cys Lys Glu Gly Ser Leu Ser Lys Glu Arg 
195 200 205 

Thr Gin Met Leu Tyr Gin Phe Val Leu Leu Phe Ala Val Phe Asp Glu 
210 215 220 

Gly Lys Ser Trp His Ser Glu Thr Asn Asp Ser Tyr Thr Gin Ser Met 
225 230 235 240 

Asp Ser Ala Ser Ala Arg Asp Trp Pro Lys Met His Thr Val Asn Gly 
245 250 255 

Tyr Val Asn Arg Ser Leu Pro Gly Leu He Gly Cys His Arg Lys Ser 
260 265 270 

Val Tyr Trp His Val He Gly Met Gly Thr Thr Pro Glu He His Ser 
275 280 285 

He Phe Leu Glu Gly His Thr Phe Phe Val Arg Asn His Arg Gin Ala 
290 295 300 
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Ser Leu Glu lie Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu 
305 . 310 315 320 

lie Asp Leu Gly Gin Phe Leu Leu Phe Cys His lie Ser Ser His Lys 
325 330 335 

His Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu 
340 345 350 

Ser Gin Trp Gin Lys Lys Asn Asn Asn Glu Glu Met Glu Asp Tyr Asp 
355 360 365 

Asp Asp Leu Tyr Ser Glu Met Asp Met Phe Thr Leu Asp Tyr Asp Ser 
370 375 380 

Ser Pro Phe lie Gin lie Arg Ser Val Ala Lys Lys Tyr Pro Lys Thr 
385 . 390 395 400 

Trp lie His Tyr lie Ser Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
405 410 415 

Ser Val Pro - Thr Ser Asp Asn Gly Ser Tyr Lys Ser Gin Tyr Leu Ser 
420 425 430 

Asn Gly Pro His Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe lie 
435 440 445 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Thr lie Gin His Glu 
450 455 460 

Ser Gly Leu Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

Leu lie lie Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro 
485 490 495 

His Gly lie Thr Asp Val Ser Pro Leu His Ala Arg Arg Leu Pro Arg 
500 505 510 

Gly lie Lys His Val Lys Asp Leu Pro lie His Pro Gly Glu lie Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 540 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe lie Asn Pro Glu Arg 
545 550 555 560 

Asp Leu Ala. Ser. Gly Leu lie Gly Pro Leu Leu lie Cys -Tyr Lys Glu 
565 570 575 

Ser Val Asp Gin Arg Gly Asn Gin Met Met Ser Asp Lys Arg Asn Val 
580 585 590 

lie Leu Phe Ser lie Phe Asp Glu Asn Gin . Ser Trp Tyr lie Thr Glu 
595 600 605 
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Asn Met Gin Arg Phe Leu Pro Asn Ala Ala Lys Thr Gin Pro Gin Asp 
610 615 620 

Pro Gly Phe Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val 
625 630 635 640 

Phe Asp Ser Leu Glu Leu Thr Val Cys Leu His Glu* Val Ala Tyr Trp 
645 650 655 

His lie Leu Ser Val Gly Ala Gin Thr Asp Phe Leu Ser lie Phe Phe 
660 665 670 

Ser Gly. Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp Val Leu Gly Cys His Asn Ser Asp Phe Arg Lys Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Ser Thr Ser Asp 
725 730 735 

Tyr Tyr Glu Glu lie Tyr Glu Asp lie Pro Thr Gin Leu Val Asn Glu 
740 745 750 

Asn Asn Val lie Asp Pro Arg Ser Phe Phe Gin Asn Thr Asn His Pro 
755 760 765 

Asn Thr Arg Lys Lys Lys Phe Lys. Asp Ser Thr lie Pro Lys Asn Asp 
770 775 780 

Met Glu Lys lie Glu Pro Gin Phe Glu Glu lie Ala Glu Met Leu Lys 
785 790 795 800 

Val Gin Ser Val Ser Val Ser Asp Met Leu Met Leu Leu Gly Gin Ser 
805 810 815 

His Pro Thr Pro His Gly Leu Phe Leu Ser Asp Gly Gin Glu Ala lie 
820 825 830 

Tyr Glu Ala lie His Asp Asp His Ser Pro Asn Ala lie Asp Ser Asn 
835 840 845 

Glu Gly Pro Ser Lys Val Thr Gin Leu Arg Pro Glu Ser His His Ser 
850 855 860 

Glu Lys lie Val Phe Thr Pro Gin Pro Gly Leu Gin Leu Arg Ser Asn 
865 870 875 880 

Lys Ser Leu Glu Thr Thr lie Glu Val Lys Trp Lys Lys Leu Gly Leu 
885 890 895 

Gin Val Ser Ser Leu Pro Ser Asn Leu Met Thr- Thr Thr lie. Leu Ser 
900 905 910 
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Asp Asn Leu Lys Ala Thr Phe Glu Lys Thr Asp Ser Ser- Gly Phe Pro 
915 920 925 

Asp Met Pro Val His Ser Ser Ser Lys Leu Ser- Thr Thr Ala Phe Gly 
930 935 940 

Lys Lys Ala Tyr Ser Leu Val Gly Ser His Val Pro Leu Asn Ala Ser 
945 950 955 960 

Glu Glu Asn Ser Asp Ser Asn lie Leu Asp Ser Thr Leu Met Tyr Ser 
965 970 975 

Gin Glu Ser Leu Pro Arg Asp Asn lie Leu Ser lie Glu Asn Asp Arg 
980 985 990 

Leu Leu Arg Glu Lys Arg Phe His Gly lie Ala Leu Leu Thr Lys Asp 
995 1000 1005 

Asn Thr Leu Phe Lys Asp Asn Val Ser Leu Met Lys Thr Asn Lys Thr 
1010 1015 1020 

Tyr Asn His Ser Thr Thr Asn Glu Lys Leu His Thr Glu Ser Pro Thr 
1025 1030 1035 1040 

Ser lie Glu Asn Ser Thr Thr Asp Leu Gin Asp Ala lie Leu Lys Val 
1045 1050 1055 

Asn Ser Glu lie Gin Glu Val Thr Ala Leu lie His Asp Gly Thr Leu 
1060 1065 1070 

Leu Gly Lys Asn Ser Thr Tyr Leu Arg Leu Asn His Met Leu Asn Arg 
1075 1080 1085 

Thr Thr Ser Thr Lys Asn Lys Asp lie Phe His Arg Lys Asp Glu Asp 
1090 1095 1100 

Pro lie Pro Gin Asp Glu Glu Asn Thr lie Met Pro Phe Ser Lys Met 
1105 1110 1115 1120 

Leu Phe Leu Ser Glu Ser Ser Asn Trp Phe Lys Lys Thr Asn Gly Asn 
1125 1130 1135 

Asn Ser Leu Asn Ser Glu Gin Glu His Ser Pro Lys Gin Leu Val Tyr 
1140 1145 \ 1150 

Leu Met Phe Lys Lys Tyr Val Lys Asn Gin Ser Phe Leu Ser Glu Lys 
1155 1160 1165 

Asn Lys Val Thr Val Glu Gin -Asp Gly Phe Thr Lys Asn lie Gly Leu 
1170 1175 1180 

Lys Asp Met Ala- Phe Pro His Asn Met Ser lie Phe Leu Thr Thr Leu 
1185 1190 - . 1195 1200 

Ser Asn Val-His-Glu Asn Gly Arg' His Asn Gin Glu -Lys Asn lie Gin 
1205 1210 1215 

28 



BNSDOCID: <WO 994c£7Ml J_> 



WO 99/46274 



PCT/US99/05193 



Glu Glu lie Glu Lys Glu Ala Leu lie Glu Glu Lys Val Val Leu Pro 
1220 1225 1230 

Gin Val His Glu Ala Thr Gly Ser Lys Asn Phe Leu Lys Asp lie Leu 
1235 1240 1245 

lie Leu Gly Thr Arg Gin Asn lie Ser Leu Tyr Glu Val His Val Pro 
1250 1255 1260 

Val Leu Gin Asn lie Thr Ser lie Asn Asn Ser Thr Asn Thr Val Gin 
1265 1270 1275 1280 

lie His Met Glu His Phe Phe Lys Arg Arg Lys Asp Lys Glu Thr Asn 
1285 1290 1295 

Ser Glu Gly Leu Val Asn Lys Thr Arg Glu Met Val Lys Asn Tyr Pro 
1300 1305 1310 

Ser Gin Lys Asn lie Thr Thr Gin Arg Ser Lys Arg Ala Leu Gly Gin 
1315 1320 1325 

Phe Arg Leu Ser Thr Gin Trp Leu Lys Thr lie Asn Cys Ser Thr Gin 
1330 1335 1340 

Cys lie lie Lys Gin lie Asp His Ser Lys Glu Met Lys Lys Phe He 
1345 1350 1355 1360 

Thr Lys Ser Ser Leu Ser Asp Ser Ser Val He Lys Ser Thr Thr .Gin 
1365 1370 1375 

Thr Asn Ser Ser Asp Ser His lie Val Lys Thr Ser Ala Phe Pro Pro 
1380 1385 1390 

He Asp Leu Lys Arg Ser Pro Phe Gin Asn Lys Phe Ser His Val Gin 
1395 1400 1405 

Ala Ser Ser Tyr lie Tyr Asp Phe Lys Thr Lys Ser Ser Arg He Gin 
1410 1415 1420 

Glu Ser Asn Asn Phe Leu Lys Glu Thr Lys lie Asn Asn Pro Ser Leu 
1425 1430 1435 1440 

Ala He Leu Pro Trp Asn Met Phe He Asp Gin Gly Lys Phe Thr Ser 
1445 1450 1455 

Pro Gly Lys Ser Asn Thr Asn Ser Val Thr Tyr Lys Lys Arg Glu Asn 
1460 1465 1470 

lie He Phe Leu Lys Pro Thr Leu Pro Glu Glu Ser Gly Lys He Glu 
1475 1480 1485 

Leu Leu Pro Gin Val Ser He Gin Glu Glu Glu He Leu Pro Thr Glu 
1490 1495 . . 1500 

Thr Ser His Gly Ser Pro Gly His Leu Asn Leu Met Lys Glu. Val Phe 
1505 1510 1515 1520 
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Leu Gin Lys lie Gin Gly Pro Thr Lys Trp Asn Lys Ala Lys Arg His 
1525 1530 1535 

Gly Glu Ser He Lys Gly Lys Thr Glu Ser Ser . Lys Asn Thr Arg Ser 
1540 1545 1550 

Lys Leu Leu Asn His His Ala Trp Asp Tyr His Tyr Ala Ala Gin He 
1555 1560 15S5 

r- Pro Lys Asp Met Trp Lys Ser Lys Glu Lys Ser Pro Glu He lie Ser 
1 57 ° 1575 1580 

He Lys Gin Glu Asp Thr He Leu Ser Leu Arg Pro His Gly Asn Ser 
1585 1590 1595 1600 

- His Ser He Gly Ala Asn Glu Lys Gin Asn Trp Pro Gin Arg Glu Thr 

1605 i6io 1615 

Thr Trp Val Lys Gin Gly Gin Thr Gin Arg Thr Cys Ser Gin He Pro 
1^20 1625 1630 

Pro Val Leu Lys Arg His Gin Arg Glu Leu Ser Ala Phe Gin Ser Glu 
1635 1640 1645 

Gin Glu Ala Thr Asp Tyr Asp Asp Ala lie Thr He Glu Thr He Glu 
1650 1655 1660 

Asp Phe Asp He Tyr Ser Glu Asp He Lys Gin Gly Pro Arg Ser Phe 
1665 1670 1675 1680 

Gin Gin Lys Thr Arg His Tyr Phe He Ala Ala Val Glu Arg Leu. Trp 
1685 1690 1695 

Asp Tyr Gly Met Ser Thr Ser His. Val Leu Arg Asn Arg Tyr Gin Ser 
1700 1705 1710 

Asp Asn Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp 
1715 1720 1725 

Gly Ser Phe Ser Gin Pro Leu Tyr Arg Gly. Glu Leu Asn Glu His Leu 
1730 - 1735 1740. 

Gly Leu Leu Gly Pro Tyr He Arg Ala Glu Val Glu Asp Asn He Met 
1745 1750 1755 1760 

Val Thr Phe Lys Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser 
1765 1770 1775 

Leu He Ser Tyr Lys Glu Asp Gin Arg Gly Glu Glu. Pro Arg Arg Asn 
1780 1785 1790 

Phe Val Lys Pro Asn Glu Thr Lys He Tyr Phe Trp Lys Val Gin His 
\ 1795 1800 1805 



His Met Ala Pro Thr Glu Asp Glu Phe Asp ,Cys Lys, Ala Trp Ala Tyr 
1810 1815,^ . 1820 
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Phe Ser Asp Val Asp Leu Glu Arg Asp Met His Ser Gly Leu lie Gly 
1825 1830 1835 1840 

Pro Leu Leu lie Cys His Ala Asn Thr Leu Asn Pro Ala His Gly Arg 
1845 1850 1855 

Glri Val Ser Val Gin Glu Phe Ala Leu Leu Phe Thr lie Phe Asp Glu 
1860 1865 1870 

Thr Lys Ser Trp Tyr Phe Thr Glu Asn Val Lys Arg Asn Cys Lys Thr 
1875 1880 1885 

Pro Cys Asn Phe Gin Met Glu Asp Pro Thr Leu Lys Glu Asn Tyr Arg 
1890 1895 1900 

Phe His Ala lie Asn Gly Tyr Val Met Asp Thr Leu Pro Gly Leu Val 
1905 1910 1915 1920 

Met Ala Gin Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met Gly Asn 
1925 1930 1935 

Asn Glu Asn lie Gin Ser lie His Phe Ser Gly His Val Phe Thr Val 
1940 1945 1950 

Arg Lys Lys Glu Glu Tyr Lys Met Ala Val Tyr Asn Leu Tyr Pro Gly 
1955 1950 1965 

Val Phe Glu Thr Leu Glu Met lie Pro Ser Arg Ala Gly lie Trp Arg 
1970 1975 1980 

Val Glu Cys Leu lie Gly Glu His Leu Gin Ala Gly Met Ser Thr Leu 
1985 1990 1995 2000 

Phe Leu Val Tyr Ser Lys Gin Cys Gin lie Pro Leu Gly Met Ala Ser 
2005 2010 2015 

Gly Ser lie Arg Asp Phe Gin lie Thr Ala Ser Gly His Tyr Gly Gin 
2020 2025 2030 

Trp Ala Pro Asn Leu Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala 
2035 2040 2045 

Trp Ser Thr Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu Leu Ala 
2050 2055 2060 

Pro Met He Val His Gly He Lys Thr Gin Gly Ala Arg Gin Lys Phe 
2065 2070 2075 2080 

Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu Asp Gly 
2085 2090 2095 

Lys Lys Trp Leu Ser Tyr Gin Gly Asn Ser Thr Gly Thr Leu Met Val 
2100 2105 2110 

Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn Ser Phe Asn 
2115 2120 2125 

31 



BNSDOCID: <WO 9946274A1 _l_> 



WO 99/46274 - _ PCT/US99/05193 

Pro Pro lie lie Ala Arg Tyr lie Arg Leu His Pro Thr His Ser Ser 
2130 2135 2140 

lie Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser 
2145 2150 2155 2160 

Cys Ser lie Pro Leu Gly Met Glu Ser Lys Val lie Ser Asp Thr Gin 
2165 2170 2175 

lie Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro 
2180 2185 2190 

Ser Gin Ala Arg Leu His Leu Gin Gly Arg Thr Asn Ala Trp Arg Pro 
2195 2200 2205 

Gin Val Asn Asp Pro Lys Gin Trp Leu Gin Val Asp Leu Gin Lys Thr 
2210 2215 2220 

Met Lys Val Thr Gly lie lie Thr Gin Gly Val Lys Ser Leu Phe Thr 
2225 2230 2235 2240 

Ser Met Phe Val Lys Glu Phe Leu He Ser Ser Ser Gin Asp Gly His 
2245 2250 2255 

His Trp Thr Gin He Leu Tyr Asn Gly Lys Val Lys Val Phe Gin Gly 
2260 2265 2270 

Asn Gin Asp Ser Ser Thr Pro Met Met Asn Ser Leu Asp Pro Pro Leu 
2275 2280 2285 

Leu Thr Arg Tyr Leu Arg He His Pro Gin He Trp Glu His Gin He 
2290 2295 2300 

Ala Leu Arg Leu Glu He Leu Gly Cys Glu Ala Gin Gin Gin Tyr 
2305 2310 2315 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO , 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CCTTCCTTTA TCCAAATACG TAGATCAAGA GGAAATTGAC 4 0 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GTAGCGTTGC CAAGAAGCAC CCTAAGACG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAGAGTAGT ACGAGTTATT TCTCTGGGTT CAATGAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CCTTTATCCA AATACGTAGC GTTTGCCAAG AAG 33 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1..19 

<D) OTHER INFORMATION: /note= "R is A or G and N is A, T, 

G or C . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
AARCAYCCNA ARACNTGGG 19 
(2) INFORMATION FOR SEQ ID. NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
GCTCGCACTA GGGGGTCTTG AATTC 2 5 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) 
(B) 
(C) 
<D) 



LENGTH: 44 base pairs 
TYPE: nucleic acid 
STRANDEDNESS.: both 
TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer, 
double-stranded from nucleotide 37-44, 3' end of short strand 
blocked with amino group." 

d;:' HYPOTHETICAL: NO 

(ivi ANTI- SENSE: NO 



(i>:) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION; 37.. 44 

(D) OTHER INFORMATION: /note= "Double stranded in the 
region from nucleotides 37-44, the 3' end is blocked with an 
amino group to reduce non-specific priming." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTAATACGAC TCACTATAGG GCTCGAGCGG CCGCCCGGGC AGGT 44 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
CCATCCTAAT ACGACTCACT ATAGGGC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI - SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
CCATTGACAT GAAGAC CGTT TCTC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
ACTCACTATA GGGCTCGAGC GGC 23 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GGGTGCAAAG CGCTGACATC AGTG 24 
(2) INFORMATION FOR SEQ ID NO: 18: . . 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCTCTCGAGC CACCATGTCG AGCCACCATG CAGCTAGAGC TCTCCACCTG 50 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGCGCGGCCG CGCATCTGGC AAAGCTGAGT T 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(ix) FEATURE: 
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(A) NAME/KEY: misc_f eature 

(B) LOCATION: 25 . .27 

(D) OTHER INFORMATION: /note= "At position 25, R is A or 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GAAATAAGCC CAGGCTTTGC AGTCRAA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 21. .22 

(D) OTHER INFORMATION: /note= "At position 22, N is A, G, 

C or T. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
AGGAAATTCC ACTGGAACCT TN 22 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



( ix) FEATURE : 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..25 

(D) OTHER INFORMATION: /note= "At position 25, N is A, G, 

C or T. " . : _ ■ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTGGGGGTGA ATTCGAAGGT AGCGN 25 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GAGTTCATCG GGAAGACCTG TTG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ACAGCCCATC AACTCCATGC GAAG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TCAGGGCAAT CAGGACTCC 19 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) / TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL : NO 
(iv) ANT I - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCGTGGTGAA CGCTCTGGAC C 21 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO:27: 
GTAGAGGTCC TGTGCCTCGC AGCC 24 
(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
<B) LOCATION: 1..27 

<D) OTHER INFORMATION : /note= "S is G or C, K is G or T, R 
is A or G, and Y is C or T." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTAGAGSTSC TGKGCCTCRC AKCCYAG 2 7 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTTCGCATGG AGTTGATGGG CTGT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AATCAGGACT CCTCCACCCC CG 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GGATCCACCC CACGAGCTGG 2 0 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: . 
CGCCCTGAGG CTCGAGGTTC TAGG 24 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
AATCAGGACT CCTCCACCCC CG 22 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CCTTGCAGGA ATTCGATTCA 1 20 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
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CCGTGGTGAA CGCTCTGGAC C 

(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS : 

..(A) LENGTH: 6402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Pig 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. .6402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



ATG CAG CTA GAG CTC TCC ACC TGT GTC TTT CTG TGT CTC TTG CCA CTC 4 8 

Met Gin Leu Glu Leu Ser Thr Cys Val Phe Leu Cys Leu Leu Pro Leu 
15 10 15 

GGC TTT AGT GCC ATC AGG AGA TAC TAC CTG .GGC GCA GTG GAA CTG TCC 96 
Gly Phe Ser Ala He Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser- 
20 25 30 

TGG GAC TAC CGG CAA AGT GAA CTC CTC CGT GAG CTG CAC GTG GAC ACC 144 
Trp Asp Tyr Arg Gin Ser Glu Leu Leu Arg Glu Leu His Val Asp Thr 
35 40 45 

AGA TTT CCT GCT ACA GCG CCA GGA GCT CTT CCG TTG GGC CCG TCA GTC 192 
Arg Phe Pro Ala Thr Ala Pro Gly Ala Leu Pro Leu Gly Pro Ser Val 
50 55 60 

CTG TAC AAA AAG ACT GTG TTC GTA GAG TTC ACG GAT CAA CTT TTC AGC 240 
Leu Tyr Lys- Lys Thr Val Phe Val Glu Phe Thr Asp Gin Leu Phe Ser 
65 70 75 80 

GTT GCC AGG CCC AGG CCA CCA TGG ATG GGT CTG CTG GGT CCT ACC ATC 2 88 

Val Ala Arg Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr He 

85 90 95 

CAG GCT GAG GTT TAC GAC ACG GTG GTC GTT ACC CTG AAG AAC ATG GCT 3 36 

Gin Ala Glu Val Tyr Asp Thr Val Val Val Thr Leu Lys Asn Met Ala 
100 105 no 

TCT CAT CCC GTT AGT CTT CAC GCT GTC GGC GTC TCC TTC TGG AAA TCT 3 84 

Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Phe Trp Lys Ser 
115 120 125 



44 



BNSDOCID: <WO 9946274A1 J_> 



WO 99/46274 



PCT/US99/05193 



TCC GAA GGC GCT GAA TAT GAG GAT CAC ACC AGC CAA AGG GAG AAG GAA 4 32 

Ser Glu Gly Ala Glu Tyr Glu Asp His Thr Ser Gin Arg Glu Lys Glu 
130 135 140 

GAC GAT AAA GTC CTT CCC GGT AAA AGC CAA ACC TAC GTC TGG CAG GTC 4 80 

Asp Asp Lys Val Leu Pro Gly Lys Ser Gin Thr Tyr Val Trp Gin Val 
145 150 155 160 

CTG AAA GAA AAT GGT CCA ACA GCC TCT GAC CCA CCA TGT CTC ACC TAC 528 
Leu Lys Glu Asn Gly Pro Thr Ala Ser Asp Pro Pro Cys Leu Thr Tyr 
165 170 175 

TCA TAC CTG TCT CAC GTG GAC CTG GTG AAA GAC CTG AAT TCG GGC CTC 5 76 

Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu 
180 185 190 

ATT GGA GCC CTG CTG GTT TGT AGA GAA GGG AGT CTG ACC AGA GAA AGG 6 24 

lie Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Thr Arg Glu Arg 
195 200 205 

ACC CAG AAC CTG CAC GAA TTT GTA CTA CTT TTT GCT GTC TTT GAT GAA 6 72 

Thr Gin Asn Leu His Glu Phe Val Leu Leu Phe Ala Val Phe Asp Glu 
210 215 220 

GGG AAA AGT TGG CAC TCA GCA AGA AAT GAC TCC TGG ACA CGG GCC ATG 720 
Gly Lys Ser Trp His Ser Ala Arg Asn Asp Ser Trp Thr Arg Ala Met 
225 230 235 240 

GAT CCC GCA CCT GCC AGG GCC CAG CCT GCA ATG CAC ACA GTC AAT GGC 76 8 

Asp Pro Ala Pro Ala Arg Ala Gin Pro Ala Met His Thr Val Asn Gly 
245 250 255 

TAT GTC AAC AGG TCT CTG CCA GGT CTG ATC GGA TGT CAT AAG AAA TCA -<816 
Tyr Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Lys Lys Ser 
260 265 270 

GTC TAC TGG CAC GTG ATT GGA ATG GGC ACC AGC CCG GAA GTG CAC TCC **8 64 

Val Tyr Trp His Val lie Gly Met Gly Thr Ser Pro Glu Val His Ser 
275 280 285 

ATT TTT CTT GAA GGC CAC ACG TTT CTC GTG AGG CAC CAT CGC CAG GCT 912 
lie Phe Leu Glu Gly His Thr Phe Leu Val Arg His His Arg Gin Ala 
290 295 300 

TCC TTG GAG ATC TCG CCA CTA ACT TTC CTC ACT GCT CAG ACA TTC CTG 960 
Ser Leu Glu lie Ser Pro Leu Thr Phe Leu Thr Ala Gin Thr Phe Leu 
305 310 315 320 

ATG GAC CTT GGC CAG TTC CTA CTG TTT TGT CAT ATC TCT TCC CAC CAC 10 08 

Met Asp Leu Gly Gin Phe Leu Leu Phe Cys His lie Ser Ser His His 
325 330 335 

CAT GGT GGC ATG GAG GCT CAC GTC AGA GTA GAA AGC TGC GCC GAG GAG 1056 
His Gly Gly Met Glu Ala His Val Arg Val Glu Ser Cys Ala Glu Glu 
340 345 350 
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CCC CAG CTG CGG AGG AAA' GCT* GAT GAA GAG GAA GAT TAT GAT GAC AAT 1104 
Pro Gin Leu Arg Arg Lys Ala Asp Glu Glu Glu Asp Tyr Asp Asp Asn 
355 360 365 

TTG TAC GAC TCG GAC ATG GAC; GTG GTC CGG CTC GAT GGT GAC GAC GTG 1152 
Leu Tyr Asp Ser Asp Met Asp Val Val Arg Leu Asp Gly Asp Asp Val 
370 375 380 

TCT CCC TTT ATC CAA ATC CGC TCG GTT GCC AAG AAG CAT CCC AAA ACC 1200 
Ser Pro Phe lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 4 00 

TGG GTG CAC: TAC ATC TCT GCA GAG GAG GAG GAC TGG GAC TAC GCC CCC 1248 
Trp Val His Tyr He Ser Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
405 410 415 

GCG GTC CCC AGC CCC AGT GAC AGA AGT TAT AAA AGT CTC TAC TTG AAC 1236 
Ala Val Pro Ser Pro Ser Asp Arg Ser Tyr Lys Ser Leu Tyr Leu Asn 
420 425 430 

AGT GGT CCT CAG CGA ATT GGT AGG AAA TAC AAA AAA GCT CGA TTC GTC 1344 
Ser Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Ala Arg Phe Val 
435 440 445 

GCT TAC ACG GAT GTA ACA TTT AAG ACT CGT AAA GCT ATT CCG TAT GAA 1392 
Ala Tyr Thr Asp Val Thr Phe Lys Thr Arg Lys Ala He Pro Tyr Glu 
450 455 460 

TCA GGA ATC CTG GGA CCT TTA CTT TAT GGA GAA GTT GGA GAC ACA CTT 1440 
Ser Gly He Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu, 
465 470 475 480 

TTG ATT ATA TTT AAG AAT AAA GCG AGC CGA CCA TAT AAC ATC TAC CCT 1488 
Leu He He Phe Lys Asn Lys Ala Ser Arg Pro Tyr Asn lie Tyr Pro 
485 490 495 

CAT GGA- ATC ACT GAT GTC AGC GCT TTG CAC CCA GGG AGA CTT CTA AAA 1536 
His Gly He Thr Asp Val Ser Ala Leu His Pro Gly Arg Leu Leu Lys 
500 505 510 

GGT TGG AAA CAT TTG AAA GAC ATG; CCA ATT CTG CCA GGA GAG ACT TTC 1584 
Gly Trp' Lys His Leu Lys Asp Met Pro He Leu Pro Gly Glu Thr Phe 
515 520 525 

AAG TAT AAA TGG ACA GTG ACT GTG GAA GAT GGG CCA ACC AAG TCC GAT 1632 
Lys Tyr Lys - Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 540 

CCT CGG TGC CTG ACC CGC TAC TAC TCG AGC.. TCC ATT AAT CTA GAG AAA 163 0 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Ser He Asn Leu Glu Lys 
545 550 555 560 

GAT CTG GCT TCG GGA CTC . ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA 172 8 

Asp Leu Ala Ser Gly Leu He 'Gly Pro Leu Leu He Cys Tyr Lys Glu 
565 570 ^ - 575 
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TCT GTA GAC CAA AGA GGA AAC CAG ATG ATG TCA GAC AAG AGA AAC GTC 1776 
Ser Val Asp Gin Arg Gly Asn Gin Met Met. Ser Asp Lys Arg Asn Val 
580 585 590 

ATC CTG TTT TCT GTA TTC GAT GAG AAT CAA AGC TGG TAC CTC GCA GAG 1824 
lie Leu Phe Ser Val Phe Asp Glu Asn Gin Ser Trp Tyr Leu Ala Glu 
595 600 605 

AAT ATT CAG CGC TTC CTC CCC AAT CCG GAT GGA TTA CAG CCC CAG GAT 1872 
Asn He Gin Arg Phe Leu Pro Asn Pro Asp Gly Leu Gin Pro Gin Asp 
610 615 620 

CCA GAG TTC CAA GCT TCT AAC ATC ATG - CAC AGC ATC AAT GGC TAT GTT 192 0 

Pro Glu Phe Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val 
625 630 635 640 

TTT GAT AGC TTG CAG CTG TCG GTT TGT TTG CAC GAG GTG GCA TAC TGG 1968 
Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 
645 650 655 

TAC ATT CTA AGT GTT GGA GCA CAG ACG GAC TTC CTC TCC GTC TTC TTC 2 016 

Tyr He Leu Ser Val Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 

TCT GGC TAC ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC 2064 
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

CTG TTC CCC TTC TCA GGA GAA ACG GTC TTC ATG TCA ATG GAA AAC CCA 2112 
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

GGT CTC TGG GTC CTA GGG TGC CAC AAC TCA GAC TTG CGG AAC AGA GGG 216 0 

Gly Leu Trp Val Leu Gly Cys His Asn Ser Asp Leu Arg Asn Arg Gly 
705 710 715 720 

ATG ACA GCC TTA CTG' AAG GTG TAT AGT TGT GAC AGG GAC ATT GGT GAT 2208 
Met Thr Ala Leu Leu Lys Val Tyr Ser Cys Asp Arg Asp He Gly Asp 
725 730 735 

TAT TAT GAC AAC ACT TAT GAA GAT ATT CCA GGC TTC TTG CTG AGT GGA 22 56 

Tyr Tyr Asp Asn Thr Tyr Glu Asp He Pro Gly Phe Leu Leu Ser Gly 
740 745 750 

AAG AAT GTC ATT GAA CCC AGA AGC TTT GCC CAG AAT TCA AGA CCC CCT 2304 
Lys Asn Val He Glu Pro Arg Ser Phe Ala Gin Asn Ser Arg Pro Pro 
755 760 765 

AGT GCG-AGC CAA AAG CAA TTC CAA ACC ATC ACA AGT CCA GAA GAT GAC 2 3 52 

Ser Ala Ser Gin Lys Gin Phe Gin Thr lie Thr Ser Pro Glu Asp Asp 
770 775 780 

GTG GAG CTT GAC CCG CAG TCT GGA GAG AGA ACC CAA GCA CTG GAA GAA 2400 
Val Glu Leu Asp Pro Gin Ser Gly Glu Arg Thr Gin Ala Leu Glu Glu 
785 790 795 800 
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CTA AGT GTCCCC TCT GGT GAT GGG TCG ATG CTC TTG GGA CAG AAT CCT 2448 
Leu Ser Val Pro Ser Gly Asp Gly Ser Met Leu Leu Gly Gin Asn Pro 
805 810 815 

GCT CCA CAT GGC TCA TCC TCA TCT GAT CTT CAA GAA GCC AGG AAT GAG 24 96 

Ala Pro His Gly Ser Ser Ser Ser Asp Leu Gin Glu Ala Arg Asn Glu 
820 825 830 

GCT GAT GAT TAT TTA CCT GGA GCA AGA GAA AGA AAC ACG GCC CCA TCC 2 544 

Ala Asp Asp Tyr Leu Pro Gly Ala Arg Glu Arg Asn Thr Ala Pro Ser 
835 840 845 

GCA GCG GCA CGT CTC AGA CCA GAG CTG CAT CAC AGT GCC GAA AGA GTA 2 5 92 

Ala Ala Ala Arg Leu Arg Pro Glu Leu His His Ser Ala Glu Arg Val 
850 855 * 860 

CTT ACT CCT GAG CCA GAG AAA GAG TTG AAG AAA CTT GAT TCA AAA ATG 2 64 0 

Leu Thr Pro Glu Pro Glu Lys Glu Leu Lys Lys Leu Asp Ser Lys Met 
865 870 875 880 

TCT AGT TCA TCA GAC CTT CTA AAG ACT TCG CCA ACA ATT CCA TCA GAC 268 8 

Ser Ser Ser Ser Asp Leu Leu Lys Thr Ser Pro Thr lie Pro Ser Asp 
885 " 890 895 

ACG TTG TCA GCG GAG ACT GAA AGG ACA CAT TCC TTA GGC CCC CCA CAC 273 6 

Thr Leu Ser Ala Glu Thr Glu Arg Thr His Ser Leu Gly Pro Pro His 
900 905 910 

CCG CAG GTT AAT TTC AGG AGT CAA TTA GGT GCC ATT GTA CTT GGC AAA 2784 
Pro Gin Val Asn Phe Arg Ser Gin Leu Gly Ala lie Val Leu Gly Lys- 
915 920 925 

AAT TCA' TCT CAC TTT ATT GGG GCT GGT GTC CCT * TTG GGC TCG ACT GAG 2 832 

Asn Ser Ser His -Phe lie Gly Ala Gly Val Pro Leu Gly Ser Thr Glu 
930 935 940 

GAG GAT CAT GAA AGC TCC CTG GGA GAA AAT GTA TCA CCA GTG GAG AGT 2880 
Glu Asp His Glu Ser Ser Leu Gly Glu Asn Val Ser Pro Val Glu Ser 
945 -950 955 960 

GAC GGG ATA TTT GAA AAG GAA AGA GCT CAT GGA CCT GCT TCA CTG ACC 292 8 

Asp Gly lie Phe Glu Lys Glu Arg Ala His Gly Pro Ala Ser Leu Thr 
965 970 975 

AAA GAC GAT GTT TTA TTT AAA GTT AAT ATC TCT TTG GTA AAG ACA AAC 2 976 

Lys Asp Asp Val Leu Phe Lys Val Asn lie Ser Leu Val Lys Thr Asn 
980 985 990 

AAG GCA CGA GTT TAC TTA AAA ACT AAT AGA AAG ATT CAC ATT GAT GAC 3 024 

Lys Ala Arg Val Tyr Leu Lys Thr Asn Arg Lys lie His lie Asp Asp 
995 1000 1005 

GCA GCT TTA TTA ACT GAG AAT AGG GCA TCT GCA ACG TTT ATG GAC AAA . 3 072 
Ala Ala Leu Leu Thr Glu Asn Arg Ala Ser Ala Thr Phe Met Asp Lys 
1010 1015 1020 
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AAT ACT ACA GCT TCG GGA TTA AAT CAT GTG TCA AAT TGG ATA AAA GGG 312 0 

Asn Thr Thr Ala Ser Gly Leu Asn His Val Ser Asn Trp lie Lys Gly 
1025 1030 1035 1040 

CCC CTT GGC AAG AAC CCC CTA AGC TCG GAG CGA GGC CCC AGT CCA GAG 316 8 

Pro Leu Gly Lys Asn Pro Leu Ser Ser Glu Arg Gly Pro Ser Pro Glu 
1045 1050 1055 

CTT CTG ACA TCT TCA GGA TCA GGA AAA TCT GTG AAA GGT CAG AGT TCT 3216 
Leu Leu Thr Ser Ser Gly Ser Gly Lys Ser Val Lys Gly Gin Ser Ser 
1060 1065 1070 

GGG CAG GGG AGA ATA CGG GTG GCA GTG GAA GAG GAA GAA CTG AGC AAA 32 64 

Gly Gin Gly Arg lie Arg Val Ala Val Glu Glu Glu Glu Leu Ser Lys 
1075 1080 1085 

GGC AAA GAG ATG ATG CTT . CCC AAC AGC GAG CTC ACC TTT CTC ACT AAC 3312 
Gly Lys Glu Met Met Leu Pro Asn Ser Glu Leu Thr Phe Leu Thr Asn 
1090 1095 1100 

TCG GCT GAT GTC CAA GGA AAC GAT ACA CAC AGT CAA GGA AAA AAG TCT 33 60 

Ser Ala Asp Val Gin Gly Asn Asp Thr His Ser Gin Gly Lys Lys Ser 
1105 1110 1115 1120 

CGG GAA GAG ATG GAA AGG AGA GAA AAA TTA GTC CAA GAA AAA GTC GAC 3408 
Arg Glu Glu Met Glu Arg Arg Glu Lys Leu Val Gin Glu Lys Val Asp 
1125 1130 1135 

TTG CCT CAG GTG TAT ACA GCG ACT GGA ACT AAG AAT TTC CTG AGA AAC 3456 
Leu Pro Gin Val Tyr Thr Ala Thr Gly Thr Lys Asn Phe Leu Arg Asn 
1140 1145 1150 

ATT TTT CAC CAA AGC ACT GAG CCC AGT GTA GAA GGG TTT GAT GGG GGG 3 504 

lie Phe His Gin Ser Thr Glu Pro Ser Val Glu Gly Phe Asp Gly Gly 
1155 1160 1165 

TCA CAT GCG CCG GTG CCT CAA GAC AGC AGG TCA TTA AAT GAT TCG GCA 3 552 

Ser His Ala Pro Val Pro Gin Asp Ser Arg Ser Leu Asn Asp Ser Ala 
1170 1175 1180 

GAG AGA GCA GAG ACT CAC ATA GCC CAT TTC TCA GCA ATT AGG GAA GAG 3600 
Glu Arg Ala Glu Thr His lie Ala His Phe Ser Ala lie Arg Glu Glu 
1185 1190 1195 1200 

GCA CCC TTG GAA GCC CCG GGA AAT CGA ACA GGT CCA GGT CCG AGG AGT 3648 
Ala Pro Leu Glu Ala Pro Gly Asn Arg Thr Gly Pro Gly Pro Arg Ser 
1205 1210 1215 

GCG GTT CCC CGC CGC GTT AAG CAG AGC TTG AAA CAG ATC AGA CTC CCG 3696 
Ala Val Pro Arg Arg Val Lys Gin Ser Leu Lys Gin lie Arg Leu Pro 
1220 1225 1230 

CTA GAA GAA ATA AAG CCT GAA AGG GGG GTG GTT CTG AAT GCC ACC TCA 3 744 

Leu Glu Glu lie Lys Pro Glu Arg Gly Val Val Leu Asn Ala Thr Ser 
1235 1240 1245 
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ACC CGG TGG TCT GAA AGC AGT CCT ATC TTA CAA GGA GCC AAA AGA AAT 3 792 

Thr Arg Trp Ser Glu Ser Ser Pro He Leu Gin Gly Ala Lys . Arg Asn 
1250 1255 1260 

AAC CTT TCT TTA CCT TTC CTG ACC TTG GAA ATG GCC GGA GGT CAA GGA 384 0 

Asn Leu Ser Leu Pro Phe Leu Thr Leu Glu Met Ala Gly Gly Gin Gly 
1265 1270 1275 i 2 80 

AAG ATC AGC GCC CTG GGG AAA AGT GCC GCA GGC CCG CTG GCG TCC GGG 388 8 

Lyc lie Ser Ala Leu Gly Lys Ser Ala Ala Gly Pro Leu Ala Ser Gly 
1285 1290 1295 

AAG CTG GAG AAG GCT GTT CTC TCT TCA GCA GGC TTG TCT- GAA GCA TCT 3 93 6 

Lys Leu Glu . Lys Ala Val Leu Ser Ser Ala Gly Leu Ser Glu Ala Ser 
1300 1305 1310 

GGC AAA GCT GAG TTT CTT CCT AAA GTT CGA GTT CAT CGG GAA GAC CTG 3984 
Gly Lys Ala Glu Phe Leu Pro Lys Val Arg Val His Arg Glu Asp Leu 
1315 1320 1325 

TTG CCT CAA AAA ACC AGC AAT GTT TCT TGC GCA CAC GGG GAT CTC GGC 4032 
Leu Pro Gin Lys Thr Ser Asn Val Ser Cys Ala His Gly Asp Leu Gly 
1330 1335 1340 

CAG GAG ATC TTC CTG CAG AAA ACA CGG GGA CCT GTT AAC CTG AAC AAA 4080 
Gin Glu. He Phe Leu Gin Lys Thr Arg Gly Pro Val Asn Leu Asn Lys 
X345 1350 1355 1360 

GTA AAT AGA CCT GGA AGG ACT CCC TCC AAG CTT CTG GGT CCC CCG ATG 4128 
Val Asn Arg Pro Gly Arg Thr Pro Ser Lys Leu Leu Gly Pro Pro Met 
1365 1370 1375 

CCC AAA GAG TGG GAA TCC CTA GAG AAG TCA CCA AAA AGC ACA GCT CTC 4176 
Pro Lys Glu Trp Glu Ser Leu Glu Lys Ser Pro Lys Ser Thr Ala Leu 
1380 1385 1390 

AGG ACG AAA GAC ATC ATC AGT TTA CCC CTG GAC CGT CAC GAA AGC AAT 4224 
Arg Thr Lys Asp He He Ser Leu Pro Leu Asp Arg His Glu Ser Asn 
1395 1400 1405 

CAT TCA ATA GCA GCA AAA AAT GAA GGA CAA GCC GAG ACC CAA AGA GAA 42 72 

His Ser He Ala Ala Lys Asn Glu Gly Gin Ala Glu Thr Gin Arg Glu 
1410 1415 1420 

GCC GCC TGG ACG AAG CAG GGA GGG CCT GGA AGG CTG TGC GCT CCA AAG 4320 
Ala Ala Trp Thr Lys Gin Gly Gly Pro Gly Arg Leu Cys Ala Pro Lys 
1425 1430 1435 1440 

CCT CCG GTC CTG CGA CGG CAT CAG AGG GAC ATA AGC CTT CCT ACT TTT 4368 
Pro Pro Val Leu Arg Arg His Gin Arg Asp He Ser Leu Pro Thr Phe 
1445 1450 1455 

CAG CCG GAG GAA GAC AAA ATG GAC TAT GAT GAT ATC TTC TCA ACT GAA 4416 
Gin Pro Glu Glu Asp Lys Met Asp Tyr Asp Asp He Phe Ser Thr Glu 
1460 1465 1470 
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ACG AAG GGA GAA GAT TTT GAC ATT TAC GGT GAG GAT GAA AAT CAG GAC 4464 
Thr Lys Gly Glu Asp Phe Asp He Tyr Gly Glu Asp Glu Asn Gin Asp 
1475 1480 1485 

CCT CGC AGC TTT CAG AAG AGA ACC CGA CAC TAT TTC ATT GCT GCG GTG 4 512 

Pro Arg Ser Phe Gin Lys Arg Thr Arg His Tyr Phe He Ala Ala Val 
1490 1495 1500 

GAG CAG CTC TGG GAT TAC GGG ATG AGC GAA TCC CCC CGG GCG CTA AGA 4560 
Glu Gin Leu Trp Asp Tyr Gly Met Ser Glu Ser Pro Arg Ala Leu Arg 
1505 1510 1515 1520 

AAC AGG GCT CAG AAC GGA GAG GTG CCT CGG TTC AAG AAG GTG GTC TTC 4 6 08 

Asn Arg Ala Gin Asn Gly Glu Val Pro Arg Phe Lys Lys Val Val Phe 
1525 1530 1535 

CGG GAA TTT GCT GAC GGC TCC TTC ACG CAG CCG TCG TAC CGC GGG GAA 4656 
Arg Glu Phe Ala Asp Gly Ser Phe Thr Gin Pro Ser Tyr Arg Gly Glu 
1540 1545 1550 

CTC AAC AAA CAC TTG GGG CTC TTG GGA CCC TAC ATC AGA GCG GAA GTT 4 704 

Leu Asn Lys His Leu Gly Leu Leu Gly Pro Tyr He Arg Ala Glu Val 
1555 1560 1565 

GAA GAC AAC ATC ATG GTA ACT TTC AAA AAC CAG GCG TCT CGT CCC TAT 4 752 

Glu Asp Asn lie Met Val Thr Phe Lys Asn Gin Ala Ser Arg Pro Tyr 
1570 1575 1580 

TCC TTC TAC TCG AGC CTT ATT TCT TAT CCG GAT GAT CAG GAG CAA GGG 4800 
Ser Phe Tyr Ser Ser Leu He Ser Tyr Pro Asp Asp Gin Glu Gin Gly 
1585 1590 1595 1600 

GCA GAA CCT CGA CAC AAC TTC GTC CAG CCA AAT GAA ACC AGA ACT TAC 4848 
Ala Glu Pro Arg His Asn Phe Val Gin Pro Asn Glu Thr Arg Thr Tyr 
1605 1610 1615 

TTT TGG AAA GTG CAG CAT CAC ATG GCA CCC ACA GAA GAC GAG TTT GAC -4 8 96 

Phe Trp Lys Val Gin His His Met Ala Pro Thr Glu Asp Glu Phe Asp 
1620 1625 1630 

TGC AAA GCC TGG GCC TAC TTT TCT GAT GTT GAC CTG GAA AAA GAT GTG 4 944 

Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val 
1635 1640 1645 

CAC TCA GGC TTG ATC GGC CCC CTT CTG ATC TGC CGC GCC AAC ACC CTG 4992 
His Ser Gly Leu He Gly Pro Leu Leu He Cys Arg Ala Asn Thr Leu 
1650 1655 1660 

AAC GCT GCT CAC GGT AGA CAA GTG ACC GTG CAA GAA TTT GCT CTG TTT * 504 0 

Asn Ala Ala His Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe 
1665 1670 1675 1680 

TTC ACT ATT TTT GAT GAG ACA AAG AGC TGG TAC TTC ACT GAA AAT GTG 508 8 

Phe Thr He Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu- Asn Val 
1685 1690 1695 
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GAA AGG AAC TGC CGG GCC CCC TGC CAC CTG CAG ATG GAG GAC ; CCC ACT 513 6 

Glu Arg Asn Cys. Arg Ala Pro Cys His Leu Gin Met Glu Asp Pro Thr 
1700 1705 1710 

CTG AAA GAA AAC TAT CGC TTC CAT GCA ATC AAT GGC TAT GTG ATG GAT 5184 
Leu Lys Glu Asn Tyr Arg. Phe His Ala lie Asn Gly Tyr Val Met Asp 
1715 1720 . 1725 : 

ACA CTC CCT GGC TTA GTA ATG GCT CAG AAT CAA AGG ATC CGA TGG TAT 5232 
Thr Leu Pro Gly Leu Val Met Ala Gin Asn Gin Arg lie Arg Trp Tyr 
1730 1735 1740 

CTG CTC AGC ATG GGC AGC AAT GAA AAT ATC CAT TCG ATT CAT TTT AGC 52 8 0 

Leu Leu Ser Met Gly Ser Asn Glu Asn He His Ser He His Phe Ser 
1745 1750 1755 1760 

GGA CAC GTG TTC AGT GTA CGG AAA AAG GAG GAG TAT AAA ATG GCC GTG 53 2 8 

Gly His Val Phe Ser Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Val 
1765 1770 1775 

TAG AAT CTC TAT CCG GGT GTC TTT GAG ACA GTG GAA ATG CTA CCG TCC 53 76 

Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser 
1780 1785 . 1790 

AAA GTT GGA ATT TGG CGA ATA GAA TGC CTG ATT GGC GAG CAC CTG CAA 5424 
Lys Val Gly lie Trp Arg He Glu Cys Leu He Gly Glu His Leu Gin 
1795 1800 1805 

GCT GGG ATG AGCACG ACT TTC CTG GTG TAC AGC AAG GAG TGT CAG GCT 54 72 

Ala Gly Met Ser Thr Thr Phe Leu Val Tyr Ser Lys Glu Cys Gin Ala. 
1810 1815 1820 

CCA CTG GGA ATG GCT TCT GGA CGC ATT AGA GAT TTT CAG ATC ACA GCT 5520 
Pro Leu Gly Met Ala Ser Gly Arg He Arg. Asp. Phe r Gin He Thr Ala 
1825 1830 1835 1840 

TCA GGA CAG TAT GGA CAG TGG GCC CCA AAG CTG GCC AGA CTT CAT TAT 5568 
Ser Gly Gin Tyr. Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr 
1845 1850 1855 

TCC GGA TCA. ATC AAT GCC TGG AGC ACC AAG GAT CCC CAC TCC TGG ATC 5616 
Ser Gly Ser He Asn' Ala Trp Ser Thr Lys Asp Pro His Ser Trp lie 
1860 1865 1870 

AAG GTG GAT CTG TTG GCA CCA ATG ATC ATT CAC GGC ATC ATG ACC CAG 5664 
Lys Val. Asp Leu Leu Ala Pro Met lie lie His Gly He Met Thr Gin 
1875 1880^ 1885 

GGT GCC CGT CAG AAG TTT: TCC AGC CTC . TAC ATC TCC CAG TTT ATC ATC- 5712 
Gly Ala Arg Gin Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He 
1890 1895 . 1900 

ATG TAC AGT CTT GAC GGG AGG AAC TGG CAG AGT TAC -CGA GGG AAT TCC - 5 760 

Met Tyr Ser Leu Asp Gly Arg Asn Trp Gin Ser Tyr Arg Gly Asn Ser 
1905 1910 1915 1920 
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ACG GGC ACC TTA ATG GTC TTC TTT GGC AAT GTG GAC GCA TCT GGG ATT 5808 
Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp Ala Ser Gly lie 
1925 1930 1935 

AAA CAC AAT ATT TTT AAC CCT CCG ATT GTG GCT CGG TAC ATC CGT TTG 5856 
Lys His Asn lie Phe Asn Pro Pro lie Val Ala Arg Tyr lie Arg Leu 
1940 1945 1950 

CAC CCA ACA CAT TAC AGC ATC CGC AGC ACT CTT CGC ATG GAG TTG ATG 5904 
His Pro Thr His Tyr Ser lie Arg Ser Thr Leu Arg Met Glu Leu Met 
1955 1960 1965 

GGC TGT GAT TTA AAC AGT TGC AGC ATG CCC CTG GGA ATG CAG AAT AAA 5 952 

Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Gin Asn Lys 
1970 1975 1980 

GCG ATA TCA GAC TCA CAG ATC ACG GCC TCC TCC CAC CTA AGC AAT ATA 6000 
Ala lie Ser Asp Ser Gin lie Thr Ala Ser Ser His Leu Ser Asn lie 
1985 1990 1995 2000 

TTT GCC ACC TGG TCT CCT TCA CAA GCC CGA CTT CAC CTC CAG GGG CGG 6048 
Phe Ala Thr Trp Ser Pro Ser Gin Ala Arg Leu His Leu Gin Gly Arg 
2005 2010 2015 

ACG AAT GCC TGG CGA CCC CGG GTG AGC AGC GCA GAG GAG TGG CTG CAG 6096 
Thr Asn Ala Trp Arg Pro Arg Val Ser Ser Ala Glu Glu Trp Leu Gin 
2020 2025 2030 

GTG GAC CTG CAG AAG ACG GTG AAG GTC ACA GGC ATC ACC ACC CAG GGC 6144 
Val Asp Leu Gin Lys Thr Val Lys Val Thr Gly lie Thr Thr Gin Gly 
2035 2040 2045 

GTG AAG TCC CTG CTC AGC AGC ATG TAT GTG AAG GAG TTC CTC GTG TCC ^6192 
Val Lys Ser Leu Leu Ser Ser Met Tyr Val Lys Glu Phe Leu Val Ser 
2050 2055 2060 

AGT AGT CAG GAC GGC CGC CGC TGG ACC CTG TTT CTT CAG GAC GGC CAC = 6240 

Ser Ser Gin Asp Gly Arg Arg Trp Thr Leu Phe Leu Gin Asp Gly His 
2065 2070 2075 2080 

ACG AAG GTT TTT CAG GGC AAT CAG GAC TCC TCC ACC CCC GTG GTG AAC 6288 
Thr Lys Val Phe Gin Gly Asn Gin Asp Ser Ser Thr Pro Val Val Asn 
2085 2090 2095 

GCT CTG GAC CCC CCG CTG TTC ACG CGC TAC CTG AGG ATC CAC CCC ACG 6336 
Ala Leu Asp Pro Pro Leu Phe Thr Arg Tyr Leu Arg lie His Pro Thr 
2100 2105 2110 

AGC TGG GCG CAG CAC ATC GCC CTG AGG CTC GAG GTT CTA GGA TGT GAG 63 84 

Ser Trp Ala Gin His He Ala Leu Arg Leu Glu Val Leu Gly Cys Glu 
2115 2120 2125 

GCA CAG GAT CTC TAC TGA 6402 
Ala Gin Asp Leu Tyr * 
2130 



53 



BNSDOCID: <WO. 



.99462 7 A*: J_> 



WO 99/46274 ^ ^ PCT/yS99/05193 

(2) INFORMATION FOR SEQ ID NO:37: : - . . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2134 amino acids : 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

. (xi ), SEQUENCE DESCRIPTION : SEQ ID NO:37: 

Met Gin Leu Glu Leu Ser Thr Cys Val Phe Leu Cys Leu Leu Pro Leu 

1 • . .; . 5 . 10 , 15 

Gly Phe Ser Ala lie Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

Trp Asp Tyr Arg Gin Ser Glu Leu Leu Arg Glu Leu His Val Asp Thr 
35 40 45 



Arg Phe Pro Ala Thr Ala Pro Gly Ala Leu Pro Leu Gly Pro Ser Val 

50 . .* . 55 - : 60 . : 

Leu Tyr Lys Lys Thr Val Phe Val Glu Phe Thr Asp Gin Leu Phe Ser 
65 70 75 80 

Val Ala Arg Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie 
85 90 95 

Gin Ala Glu Val Tyr Asp Thr Val Val Val Thr Leu Lys Asn Met Ala 
100 105 110 

Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Phe Trp Lys Ser 
115 120 125 

Ser Glu Gly Ala Glu Tyr Glu Asp His Thr Ser Gin Arg Glu Lys Glu 
130 135 140 

Asp Asp Lys Val Leu Pro Gly Lys Ser Gin Thr Tyr Val Trp Gin Val 
145 , " 150 . 155 : 160 

Leu Lys Glu Asn Gly Pro Thr Ala Ser Asp Pro Pro Cys Leu Thr Tyr 
165 170 175 

Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu 
180 . 185 190 

lie Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Thr Arg Glu Arg 
- ■195 ,;. ; - v ":. - 200 , 205 , ; , , 

Thr Gin Asn Leu His Glu Phe Val Leu Leu Phe Ala Val Phe Asp Glu 
210 . 215 .220 

Gly Lys Ser Trp His Ser Ala Arg Asn Asp Ser Trp Thr Arg Ala Met 

225 • . .. 230 - • 235 : 240 
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Asp Pro Ala Pro Ala Arg Ala Gin Pro Ala Met His Tlir Val Asn Gly 
245 250 255 

Tyr Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Lys Lys Ser 
260 265 270 

Val Tyr Trp His Val lie Gly Met Gly Thr Ser Pro Glu Val His Ser 
275 280 285 

lie Phe Leu Glu Gly His Thr Phe Leu Val Arg His His Arg Gin Ala 
290 295 300 

Ser Leu Glu lie Ser Pro Leu Thr Phe Leu Thr Ala Gin Thr Phe Leu 
305 310 315 320 

Met Asp Leu Gly Gin Phe Leu Leu Phe Cys His lie Ser Ser His His 
325 330 335 

His Gly Gly Met Glu Ala His Val Arg Val Glu Ser Cys Ala Glu Glu 
340 345 350 

Pro Gin Leu Arg Arg Lys Ala Asp Glu Glu Glu Asp Tyr Asp Asp Asn 
355 360 365 

Leu Tyr Asp Ser Asp Met Asp Val Val Arg Leu Asp Gly Asp Asp Vai 
370 375 380 

Ser Pro Phe lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

Trp Val His Tyr lie Ser Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
405 410 415 

Ala Val Pro Ser Pro Ser Asp Arg Ser Tyr Lys Ser Leu Tyr Leu Asn 
420 425 430 

Ser Gly Pro Gin Arg lie Gly Arg Lys Tyr Lys Lys Ala Arg Phe Val 
435 440 445 

Ala Tyr Thr Asp Val Thr Phe Lys Thr Arg Lys Ala He Pro Tyr Glu 
450 455 460 

Ser Gly lie Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

Leu He lie Phe Lys Asn Lys Ala Ser Arg Pro Tyr Asn He Tyr Pro 
485 490 495 

His Gly lie Thr Asp Val Ser Ala Leu His Pro Gly Arg Leu Leu' Lys 
500 505 510 

Gly Trp Lys His Leu Lys Asp Met Pro He Leu Pro Gly Glu Thr Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 540 
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Pro Arg Cys Leu Thr Arg Tyr Tyr 
545 550 

Asp Leu Ala Ser Gly. Leu lie Gly 
565 

Ser Val Asp Gin Arg Gly Asn Gin 
580 

lie Leu Phe Ser Val Phe Asp Glu 
595 600 

Asn lie Gin Arg Phe Leu Pro Asn 
610 615 



Ser- Ser Ser lie Asn Leu Glu Lys 
555 560 

Pro Leu Leu lie Cys Tyr Lys Glu 
570 575 

Met Met Ser Asp Lys Arg Asn Val 
585 590 

Asn Gin Ser Trp Tyr Leu Ala Glu 
605 

Pro Asp Gly Leu Gin . Pro Gin Asp 
620 



Pro Glu Phe Gin Ala 
625 

Phe Asp Ser Leu Gin 
645 

Tyr lie -Leu Ser Val 
660 



Ser Asn lie Met His Ser 
630 635 

Leu Ser Val Cys Leu His 
650 

Gly Ala Gln ; Thr Asp Phe 
665 



lie Asn Gly Tyr Val 
640 

Glu Val Ala Tyr Trp 
655 

Leu Ser Val Phe Phe . 
670 



Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp Val Leu Gly Cys His Asn Ser Asp Leu Arg Asn Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Tyr Ser Cys Asp Arg Asp lie Gly Asp 
725 730 735 



Tyr Tyr "Asp. Asn Thr Tyr Glu Asp lie Pro Gly. Phe Leu Leu Ser Gly 
740 745 750 

Lys Asn- Val lie Glu Pro Arg Ser Phe Ala Gin Asn Ser Arg Pro Pro 
755 760 : 765 

Ser Ala. Ser Gin Lys Gin Phe Gin Thr lie Thr Ser Pro Glu Asp Asp 
770 775 780 

Val Glu Leu Asp Pro Gin Ser Gly. Glu Arg Thr Gin Ala Leu Glu Glu 
785 790 795 800 

Leu Ser Val Pro Ser Gly Asp. Gly Ser; Met Lexi Leu Gly Gin Asn Pro, 
, ; 805 810 >. 815 

Ala Pro His Gly Ser. Ser Ser Ser Asp Leu ," Gin ■ Glu Ala Arg Asn Glu 
- 820. .: 825 . 830 

Ala Asp Asp Tyr Leu Pro Gly Ala Arg. Glu Arg Asn Thr Ala Pro Ser 
835 . 840 845 
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Ala Ala Ala Arg Leu Arg Pro Glu Leu His His Ser Ala Glu Arg Val 
850 855 860 

Leu Thr Pro Glu Pro Glu Lys Glu Leu Lys Lys Leu Asp Ser Lys Met 
865 870 875 880 

Ser Ser Ser Ser Asp Leu Leu Lys Thr Ser Pro Thr lie Pro Ser Asp 
885 890 895 

Thr Leu Ser Ala Glu Thr Glu Arg Thr His Ser Leu Gly Pro Pro His 
900 905 910 

Pro Gin Val Asn Phe Arg Ser Gin Leu Gly Ala lie Val Leu Gly Lys 
915 920 925 

Asn Ser Ser His Phe lie Gly Ala Gly Val Pro Leu Gly Ser Thr Glu 
930 935 940 

Glu Asp His Glu Ser Ser Leu Gly Glu Asn Val Ser Pro Val Glu Ser 
945 950 955 960 

Asp Gly lie Phe Glu Lys Glu Arg Ala His Gly Pro Ala Ser Leu Thr 
965 970 975 

Lys Asp Asp Val Leu Phe Lys Val Asn lie Ser Leu Val Lys Thr Asn 
980 985 990 

Lys Ala Arg Val Tyr Leu Lys Thr Asn Arg Lys lie His He Asp Asp 
995 1000 1005 

Ala Ala Leu Leu Thr Glu Asn Arg Ala Ser Ala Thr Phe Met Asp Lys 
1010 1015 1020 

— Asn Thr Thr Ala Ser Gly Leu Asn His Val Ser Asn Trp He Lys Gly 
1025 1030 1035 1040 

Pro Leu Gly Lys Asn Pro Leu Ser Ser Glu Arg Gly Pro Ser Pro Glu 
1045 1050 1055 

Leu Leu Thr Ser Ser Gly Ser Gly Lys Ser Val Lys Gly Gin Ser Ser 
1060 1065 1070 

Gly Gin Gly Arg - He Arg Val Ala Val Glu Glu Glu Glu Leu Ser Lys 
1075 1080 1085 

Gly Lys Glu Met Met Leu Pro Asn Ser Glu Leu Thr Phe Leu Thr Asn 
1090 1095 1100 

Ser Ala Asp-Val Gin . Gly Asn Asp Thr His Ser Gin Gly Lys -Lys Ser 
1105 1110 1115 1120 

Arg Glu Glu Met Glu Arg Arg Glu Lys Leu Val Gin Glu Lys Val Asp 
1125 1130 1135 

Leu Pro Gin Val Tyr Thr Ala Thr Gly Thr Lys Asn Phe Leu Arg Asn 
1140 1145 1150 
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lie Phe His Gin Ser Thr .Glu Pro Ser Val Glu Gly Phe Asp.Gly Gly 
1155 1160 H65 

Ser His Ala Pro Val Pro Gin Asp Ser Arg Ser Leu Asn Asp Ser Ala 
1170 1175 1180 

Glu Arg Ala Glu Thr His He Ala His Phe Ser Ala He Arg : Glu Glu 
1185 1190 H95 . 1200 

Ala Pro Leu- Glu Ala Pro Gly Asn Arg Thr Gly Pro Gly Pro Arg Ser 
1205 1210 1215 

Ala Val Pro Arg Arg Val Lys Gin Ser Leu Lys Gin. He Arg Leu Pro 
1220 1225 1230 

Leu Glu Glu He Lys Pro Glu Arg Gly Val Val Leu Asn Ala Thr Ser 
1235 1240 1245 

Thr Arg Trp Ser Glu Ser Ser Pro He Leu Gin Gly Ala Lys Arg Asn 
1250 1255 1260 

Asn Leu Ser Leu Pro Phe Leu Thr Leu Glu Met Ala Gly Gly Gin Gly 
1265 1270 1275 1280 

Lys He Ser Ala Leu Gly Lys Ser Ala Ala Gly Pro Leu Ala Ser Gly 
1285 1290 1295 

Lys Leu Glu Lys Ala Val Leu Ser Ser Ala Gly Leu Ser Glu Ala Ser 
1300 1305 1310 

r 

Gly Lys Ala Glu Phe Leu Pro Lys Val Arg Val. His Arg Glu Asp Leu 
1315 1320 1325 

Leu Pro Gin Lys Thr Ser Asn Val Ser Cys Ala His , Gly Asp Leu Gly 
1330 1335 1340 

Gin Glu. lie ..Phe Leu Gin Lys Thr. Arg Gly Pro Val Asn Leu Asn Lys 
1345 1350 1355 1360 

Val Asn Arg Pro Gly Arg . Thr Pro Ser Lys Leu Leu Gly Pro Pro Met 
1365 1370 1375 

Pro Lys Glu Trp Glu Ser Leu Glu Lys Ser Pro Lys Ser Thr Ala Leu 
1380 ■ 1385 1390 

Arg Thr Lys Asp lie He Ser Leu Pro Leu Asp Arg His Glu Ser Asn - 
1395 1400 1405 

His Ser lie Ala Ala ; Lys -Asn Glu Gly Gin Ala Glu Thr. Gin Arg' Glu 
1410 1415 .. 1420 

Ala Ala Trp Thr Lys Gin Gly Gly Pro - Gly Arg Leu Cys Ala Pro Lys 
1425 1430 1435 1440 

Pro Pro Val Leu Arg .Arg His Gin Arg Asp He Ser Leu Pro Thr, Phe 
1445 1450 1455 
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Gin Pro Glu Glu Asp Lys Met Asp Tyr Asp Asp lie Phe Ser Thr Glu 
1460 1465 1470 

Thr Lys Gly Glu Asp Phe Asp lie Tyr Gly Glu Asp Glu Asn Gin Asp 
1475 1480 1485 

Pro Arg. Ser Phe Gin Lys Arg Thr Arg His Tyr Phe He Ala Ala Val 
1490 1495 1500 

Glu Gin Leu Trp Asp Tyr Gly Met Ser Glu Ser Pro Arg Ala Leu Arg 
1505 1510 1515 1520 

Asn Arg Ala Gin Asn Gly Glu Val Pro' Arg Phe Lys Lys Val Val Phe 
1525 1530 1535 

Arg Giu Phe Ala Asp Gly Ser Phe Thr Gin Pro Ser Tyr Arg Gly Glu 
1540 1545 1550 

Leu Asn Lys His Leu Gly Leu Leu Gly Pro Tyr He Arg Ala Glu Val 
1555 1560 1565 

Glu Asp Asn He Met Val Thr Phe Lys Asn Gin Ala Ser Arg Pro Tyr 
1570 1575 1580 

Ser Phe Tyr Ser Ser Leu He Ser Tyr Pro Asp Asp Gin Glu Gin Gly 
1585 1590 1595 1600 

Ala Glu Pro Arg His Asn Phe Val Gin Pro Asn Glu Thr Arg Thr Tyr 
1605 1610 1615 

Phe Trp Lys Val Gin His His Met Ala Pro Thr Glu Asp Glu Phe Asp 
1620 1625 1630 

Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val . 
1635 1640 1645 

His Ser Gly Leu He Gly Pro Leu Leu He Cys Arg Ala Asn Thr Leu 
1650 1655 1660 

Asn Ala Ala His Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe 
1665 1670 1675 1680 

Phe Thr He Phe Asp Glu Thr Lys Ser Trp Tyr. Phe Thr Glu Asn Val 
1685 1690 1695 

Glu Arg Asn Cys Arg Ala Pro Cys His Leu Gin Met Glu Asp Pro Thr 
1700 1705 1710 

Leu Lys Glu Asn Tyr Arg Phe His Ala He Asn Gly Tyr Val Met Asp 
1715 1720 1725 

Thr Leu Pro Gly Leu Val Met Ala Gin Asn Gin Arg He Arg Trp Tyr 
1730 1735 1740 

Leu Leu Ser Met Gly Ser Asn Glu Asn He His Ser He His Phe Ser 
1745 1750 1755 1760 
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Gly His Val Phe. Ser, Val Arg Lys Lys Glu' Glu Tyr Lys Met Ala Val 
1765 1770 1775 

Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser 
,1780 1785 1790 

Lys Val Gly lie Trp Arg lie Glu Cys Leu He Gly Glu His Leu Gin 
1795 1800 1805 

Ala Gly Met Ser Thr ..Thr Phe Leu Val Tyr Ser Lys Glu . Cys Gin Ala 
1810 1815 1820 

Pro Leu Gly Met Ala Ser Gly Arg He Arg Asp Phe Gin He Thr Ala 
1825 1830 1835 1840 

Ser Gly Gin Tyr Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr 
1845 1850 1855 

Ser Gly Ser He Asn Ala Trp Ser Thr Lys Asp Pro His Ser Trp He 
I860 1865 1870. 

Lys Val Asp Leu Leu Ala Pro Met lie He His Gly He Met Thr Gin 
1875 1880 1885 

Gly Ala Arg Gin Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He 
1890 1895 1900 

Met Tyr Ser Leu Asp Gly Arg Asn Trp Gin Ser Tyr Arg Gly Asn Ser 
1905 1910 1915 1920 

Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp Ala Ser Gly He 
1925 1930 1935 

Lys His Asn lie Phe Asn Pro Pro lie Val Ala Arg Tyr He Arg, Leu 
1940 1945 1950 

His Pro Thr His Tyr. Ser He Arg Ser Thr Leu Arg Met Glu Leu Met 
1955 1960 1965 

Gly Cys . Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Gin Asn Lys 
1970 1975 1980 

Ala He Ser Asp Ser Gin He Thr Ala Ser Ser His Leu Ser Asn He 
1985 1990 1995 2000 

Phe Ala Thr Trp Ser Pro Ser. Gin Ala Arg Leu His Leu Gin Gly Arg 
2005 2010 2015 



Thr Asn Ala Trp Arg Pro Arg Val Ser Ser Ala Glu Glu Trp Leu Gin 
2020 . 2025 . . .. . 2030 

Val Asp Leu Gin Lys Thr Val Lys Val Thr Gly He Thr Thr Gin Gly 
2035 2040 2045 

Val Lys - Ser Leu Leu Ser Ser Met Tyr Val Lys Glu Phe Leu Val Ser 
2050 2055 2060 
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Ser Ser Gin Asp Gly Arg Arg Trp Thr Leu Phe Leu Gin Asp Gly His 
2065 . 2070 2075 2080 

Thr Lys Val Phe Gin Gly Asn Gin Asp Ser Ser Thr Pro Val Val Asn 
2085 2090 2095 

Ala Leu Asp Pro Pro Leu Phe Thr Arg Tyr Leu Arg lie His Pro Thr 
2100 2105 2110 

Ser Trp Ala Gin His lie Ala Leu Arg Leu Glu Val Leu Gly Cys Glu 
2115 2120 2125 

Ala Gin Asp Leu Tyr * 
2130 

(2) INFORMATION FOR SEQ ID NO: 38: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Factor VIII lacking B domain 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 4334 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GA ATG CAG CTA GAG CTC TCC ACC TGT GTC TTT CTG TGT CTC TTG CCA 47 
Met Gin Leu Glu Leu Ser Thr Cys Val Phe Leu Cys Leu Leu Pro 
15 10 15 

CTC GGC TTT AGT GCC ATC AGG AGA TAC TAC CTG GGC GCA GTG GAA CTG 95 
Leu Gly Phe Ser Ala lie Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu 
20 25 30 

TCC TGG GAC TAC CGG CAA AGT GAA CTC CTC CGT GAG CTG CAC GTG GAC 143 
Ser Trp Asp Tyr Arg Gin Ser Glu Leu Leu Arg Glu Leu His Val Asp 
35 40 45 

ACC AGA TTT OCT GCT ACA GCG CCA GGA GCT CTT CCG TTG GGC CCG TCA 191 
Thr Arg Phe Pro Ala Thr Ala Pro Gly Ala Leu Pro Leu Gly Pro Ser 
50 55 60 

GTC CTG TAC AAA AAG ACT GTG TTC GTA GAG TTC ACG GAT CAA CTT TTC 23 9 

Val Leu Tyr Lys Lys Thr Val Phe Val Glu Phe Thr Asp Gin Leu Phe 
65 70 75 
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AGC GTT GCC. AGG CCC AGG CCA CCA TGG ATG .GGT. CTG CTG GGT CCT ACC 2 87 

Ser Val Ala Arg Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr 
80 85 90 95 

ATC CAG GCT GAG GTT TAC GAC ACG GTG ' GTC GTT ACC CTG AAG AAC ATG 3 35 

lie Gin Ala Glu Val Tyr Asp Thr Val Val Val Thr Leu Lys Asn Met 
100 . . 105 HO 

GCT TCT CAT CCC GTT AGT CTT CAC GCT GTC GGC GTC TCC TTC TGG AAA 383 
Ala Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Phe Trp Lys 
115 120 125 

TCT Tcc GAA GGC, GCT GAA TAT GAG GAT CAC ACC AGC ; CAA AGG GAG AAG . 431 
Ser Ser Glu Gly Ala Glu Tyr Glu Asp His Thr Ser Gin Arg Glu Lys 
130 135 140 

GAA GAC GAT AAA GTC CTT CCC GGT AAA AGC CAA ACC TAC GTC TGG CAG 4 79 

Glu Asp Asp Lys Val Leu Pro Gly Lys Ser Gin Thr Tyr Val Trp Gin 
145 150 155 

GTC CTG AAA GAA AAT GGT CCA ACA GCC TCT GAC CCA CCA TGT CTC ACC 527 
Val Leu Lys Glu Asn Gly Pro Thr Ala Ser Asp Pro Pro Cys Leu Thr 
160 165 ivo 175 

TAC TCA TAC CTG TCT CAC GTG GAC CTG GTG AAA GAC CTG AAT TCG ' GGC 575 
Tyr Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly 
180 185 190 

CTC ATT GGA GCC CTG CTG GTT TGT AGA GAA GGG AGT CTG ACC AGA GAA 623 
Leu lie Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Thr Arg Glu 
195 200 205 

AGG ACC CAG AAC CTG CAC GAA TTT GTA CTA CTT TTT GCT GTC TTT GAT 671 
Arg. Thr Gin Asn Leu His Glu Phe Val Leu Leu Phe. Ala Val . Phe Asp 
210 215 220 

GAA GGG - AAA AGT TGG CAC TCA GCA AGA AAT GAC TCC TGG ACA CGG GCC 719 
Glu Gly Lys Ser Trp His Ser Ala Arg Asn Asp Ser Trp Thr Arg Ala 
225 230 235 

ATG GAT CCC GCA CCT GCC AGG GCC CAG CCT GCA ATG - CAC ACA GTC . AAT 767 
Met Asp .-Pro Ala Pro . Ala Arg Ala Gin Pro Ala Met His Thr Val Asn 
240 245 250 255 

GGC TAT GTC AAC AGG TCT CTG CCA GGT CTG ATC GGA TGT CAT AAG AAA 815 
Gly Tyr Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Lys Lys 
260 265 270 

TCA GTC TAC TGG CAC GTG ATT GGA ATG GGC ACC AGC ~ CCG • GAA- GTG CAC/ 863 
Ser Val Tyr Trp His Val lie Gly Met' Gly Thr Ser Pro Glu Val His ; 
275 280 285 

TCC ATT TTT CTT GAA GGC CAC ACG TTT' CTC GTG AGG CAC CAT CGG CAG - - - 911 
Ser He Phe Leu Glu Gly His Thr Phe Leu Val. Arg His His Arg . Gin 
290 295 300 
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GCT TCC TTG GAG ATC TCG CCA CTA ACT TTC CTC ACT GCT CAG ACA TTC 959 
Ala Ser Leu Glu lie Ser Pro Leu Thr Phe Leu Thr Ala Gin Thr Phe 
305 310 315 

CTG ATG GAC CTT GGC CAG TTC CTA CTG TTT TGT CAT ATC TCT TCC CAC 1007 
Leu Met Asp Leu Gly Gin Phe Leu Leu Phe Cys His lie Ser Ser His 
320 325 330 335 

CAC CAT GGT GGC ATG GAG GCT CAC GTC AGA GTA GAA AGC TGC GCC GAG 1055 
His His Gly Gly Met Glu Ala His Val Arg Val Glu Ser Cys Ala Glu 
340 345 350 

GAG CCC CAG CTG CGG AGG AAA GCT GAT GAA GAG GAA GAT TAT GAT GAC 1103 
Glu Pro Gin Leu Arg Arg Lys Ala Asp Glu Glu Glu Asp Tyr Asp Asp 
355 360 365 

AAT TTG TAC GAC TCG GAC ATG GAC GTG GTC CGG CTC GAT GGT GAC GAC 1151 
Asn Leu Tyr Asp Ser Asp Met Asp Val Val Arg Leu Asp Gly Asp Asp 
370 375 380 

GTG TCT CCC TTT ATC CAA ATC CGC TCG GTT GCC AAG AAG CAT CCC AAA 1199 
Val Ser. Pro Phe He Gin He Arg Ser Val Ala Lys Lys His Pro Lys 
385 390 395 

ACC TGG GTG CAC TAC ATC TCT GCA GAG GAG GAG GAC TGG GAC TAC GCC 124 7 

Thr Trp Val His Tyr He Ser Ala Glu Glu Glu Asp Trp Asp Tyr Ala 
400 405 410 415 

CCC GCG GTC CCC AGC CCC AGT GAC AGA AGT TAT AAA AGT CTC TAC TTG 1295 
Pro Ala Val Pro Ser Pro Ser Asp Arg Ser Tyr Lys Ser Leu Tyr Leu 
420 425 430 

AAC AGT GGT CCT CAG CGA ATT GGT AGG AAA TAC AAA AAA GCT CGA TTC 134 3 

Asn Ser Gly Pro Gin Arg lie Gly Arg Lys Tyr Lys Lys Ala Arg Phe 
435 440 445 

GTC GCT TAC ACG GAT GTA ACA TTT AAG ACT CGT AAA GCT ATT CCG TAT 1391 
Val Ala Tyr Thr Asp Val Thr Phe Lys Thr Arg Lys Ala He Pro Tyr 
450 455 460 

GAA TCA GGA ATC CTG GGA CCT TTA CTT TAT GGA GAA GTT GGA GAC ACA 143 9 

Glu Ser Gly He Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr 
465 470 475 

CTT TTG ATT ATA TTT AAG AAT AAA GCG AGC CGA CCA TAT AAC ATC TAC 1487 
Leu Leu He He Phe Lys - Asn Lys Ala Ser Arg Pro Tyr Asn He Tyr 
480 485 490 495 

CCT CAT GGA ATC ACT GAT GTC AGC GCT TTG CAC CCA GGG AGA CTT CTA 153 5 

Pro His Gly He Thr Asp Val Ser Ala Leu His Pro Gly Arg Leu Leu 
500 505 510 

AAA GGT TGG AAA CAT TTG AAA GAC ATG CCA ATT CTG CCA GGA GAG ACT 1583 
Lys Gly Trp Lys His Leu Lys Asp Met Pro He Leu Pro Gly Glu Thr 
515 520 525 
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TTC AAG TAT AAA TGG ACA GTG ACT GTG G AA 1 GAT GGG CCA ACC AAG TCC 1631 
Phe Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser 
530 535 540 

GAT CCT CGG TGC CTG ACC CGC TAC TAC TCG AGC TCC ATT AAT CPA GAG 16 79 

Asp Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Ser He Asn Leu Glu 
545 550 555 

.AAA GAT CTG GCT TCG GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA 172 7 

Lys Asp Leu Ala Ser Gly Leu He Gly, Pro Leu Leu He Cys Tyr Lys 
560 565 570 5 75 

GAA TCT GTA GAC CAA AGA GGA AAC CAG ATG ATG TCA GAC AAG AGA AAC 1775 
Glu Ser Val Asp Gin Arg Gly Asn Gin Met Met Ser Asp Lys Axg Asn 
580 585 590 

GTC ATC CTG TTT TCT GTA TTC GAT GAG AAT CAA AGC TGG TAC CTC GCA 1823 
Val He Leu Phe Ser Val Phe Asp Glu Asn Gin Ser Trp Tyr Leu Ala 
595 600 605 

GAG AAT ATT CAG CGC TTC CTC CCC AAT CCG GAT GGA TTA CAG CCC CAG 18 71 

Glu Asn He Gin Arg Phe Leu Pro Asn Pro Asp Gly Leu Gin Pro Gin 
610 615 620 

GAT CCA GAG TTC CAA GCT TCT AAC ATC ATG CAC AGC ATC AAT GGC TAT 1919 

Asp Pro Glu Phe Gin Ala Ser Asn He Met His Ser He Asn Gly Tyr 

625 630 635 

r 

GTT TTT GAT AGC TTG CAG CTG TCG GTT *. TGT TTG. CAC . GAG GTG GCA TAC 196 7 

Val Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr ... 
640 645 650 655 

TGG TAC ATT CTA AGT, GTT GGA GCA CAG ACG GAC TTC CTC TCC GTC TTC 2 015 

Trp Tyr lie Leu Ser Val Gly Ala Gin Thr Asp Phe Leu Ser Val Phe 
660 665 670 

TTC TCT: GGC TAC ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC * 2063 
Phe Ser Gly Tyr Thr Phe Lys His Lys Met Val ;Tyr Glu Asp Thr Leu . 
675 680 685 

ACC CTG TTC CCC ,TTC TCA GGA GAA ACG GTC TTC ATG TCA ATG GAA AAC 2111 
Thr Leu. Phe Pro 'Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn 
690 695 700 

CCA GGT CTC TGG GTC CTA GGG TGC CAC AAC TCA GAC TTG CGG AAC AGA 2159 
Pro Gly Leu Trp Val Leu Gly. Cys His Asn. Ser Asp Leu Arg Asn Arg 
705 710 715 

GGG ATG ACA GCC TTA CTG "AAG GTG TAT AGT TGT GAC AGG GAC ATT GGT - 22 07 
Gly Met , Thr Ala Leu Leu Lys , Val Tyr Ser Cys Asp Arg Asp lie . Gly 
720 725 730 735 

GAT TAT TAT GAC AAC ACT TAT GAA GAT ATT -CCA GGC TTC TTG CTG AGT 22 55 

Asp Tyr. Tyr Asp. Asn Thr Tyr Glu Asp He Pro Gly Phe Leu Leu Ser 
740 745 750 
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GGA AAG AAT GTC ATT GAA CCC AGA GAC ATA AGC CTT CCT ACT TTT GAG 2 303 

Gly Lys Asn Val lie Glu Pro Arg Asp lie Ser Leu Pro Thr Phe Gin 
755 760 765 



CCG GAG GAA GAC AAA ATG GAC TAT GAT GAT ATC TTC TCA ACT GAA ACG 2 3 51 

Pro Glu Glu Asp Lys Met Asp Tyr Asp Asp lie Phe Ser Thr Glu Thr 
770 775 780 



AAG GGA GAA GAT TTT GAC ATT TAC GGT GAG GAT GAA AAT CAG GAC CCT 2 399 

Lys Gly Glu Asp Phe Asp lie Tyr Gly Glu Asp Glu Asn Gin Asp Pro 
785 790 795 

CGC AGC TTT CAG AAG AGA ACC CGA CAC TAT TTC ATT GCT GCG GTG GAG 244 7 

Arg Ser Phe Gin Lys Arg Thr Arg His Tyr Phe lie Ala Ala Val Glu 
800 805 810 815 

CAG CTC TGG GAT TAC GGG ATG AGC GAA TCC CCC CGG GCG CTA AGA AAC " 24 95 

Gin Leu Trp Asp Tyr Gly Met Ser Glu Ser Pro Arg Ala Leu Arg Asn 
820 825 830 



AGG GCT CAG AAC GGA GAG GTG CCT CGG TTC AAG AAG GTG GTC TTC CGG 2 54 3 

Arg Ala Gin Asn Gly Glu Val Pro Arg Phe Lys Lys Val Val- Phe Arg 
835 840 845 



GAA TTT GCT GAC GGC TCC TTC ACG CAG CCG TCG TAC CGC GGG GAA CTC 2 5 91 

Glu Phe Ala Asp Gly Ser Phe Thr Gin Pro Ser Tyr Arg Gly Glu Leu 
850 855 860 

AAC -AAA CAC TTG GGG CTC TTG GGA CCC TAC ATC AGA GCG GAA GTT GAA 263 9 

Asn Lys His Leu Gly Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu 
865 870 875 

GAC AAC ATC ATG GTA ACT TTC AAA AAC CAG GCG TCT CGT CCC TAT TCC 2687 
Asp Asn lie Met Val Thr Phe Lys Asn Gin Ala Ser Arg Pro Tyr Ser 
880 885 890 895 

TTC TAC TCG AGC CTT ATT TCT TAT CCG GAT GAT CAG GAG CAA GGG GGA 2*73 5 

Phe Tyr Ser Ser Leu lie Ser Tyr Pro Asp Asp Gin Glu Gin Gly Ala 
900 905 - 910 

GAA CCT CGA CAC AAC TTC GTC CAG CCA AAT GAA ACC AGA ACT TAC TTT 27 83 

Glu Pro Arg His Asn Phe Val Gin Pro Asn Glu Thr Arg Thr Tyr Phe 
915 920 925 

TGG AAA GTG CAG CAT CAC ATG GCA CCC ACA GAA GAC GAG TTT GAC TGC 2831 
Trp Lys Val Gin His His Met Ala Pro Thr Glu Asp Glu Phe Asp Cys 
930 935 940 

AAA GCC'TGG GCC TAC TTT TCT GAT GTT GAC CTG GAA AAA GAT GTG CAC 28 79 

Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His 
945 950 955 

TCA GGC TTG ATC GGC CCC CTT CTG ATC TGC CGC GCC AAC ACC CTG AAC 2 92 7 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Arg Ala Asn Thr Leu Asn 
960 965 970 975 
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GCT GCT CAC GGT AGA CAA GTG ACC GTG CAA GAA TTT GCT CTG TTT TTC 2975 
Ala Ala His Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe 
980 985 990 



ACT ATT TTT GAT GAG ACA AAG AGC TGG TAG TTC ACT GAA AAT GTG GAA 
Thr He Phe Asp Glu Thr- Lys Ser Trp Tyr Phe Thr Glu Asn Val Glu 
995 1000 1005 



3023 



AGG AAC TGC CGG GCC CCC TGC CAC CTG CAG ATG GAG GAC CCC ACT CTG 3 071 

Arg Asn Cys Arg Ala Pro Cys His Leu Gin Met Glu Asp Pro Thr Leu 
1010 1015 1020 

AAA GAA AAC TAT CGC TTC CAT GCA ATC AAT GGC TAT GTG ATG GAT ACA 3119 
Lys Glu Asn Tyr Arg Phe His Ala He Asn Gly Tyr Val Met Asp Thr 
1025 1030 1035 

CTC CCT GGC TTA GTA ATG GCT CAG AAT CAA AGG ATC CGA TGG TAT CTG 3167 
Leu Pro Gly Leu Val Met Ala Gin Asn Gin Arg He Arg Trp Tyr Leu 
1040 1045 1050 1055 

CTC AGC ATG GGC AGC AAT GAA AAT ATC CAT TCG ATT CAT TTT AGC GGA 3215 
Leu Ser Met Gly. Ser Asn Glu Asn He His Ser He His Phe Ser Gly 
1060 1065 1070 

CAC GTG TTC AGT GTA CGG AAA AAG GAG GAG TAT AAA ATG GCC GTG TAC 3 263 

His Val Phe Ser Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Val Tyr 
1075 1080 1085 

AAT CTC TAT CCG GGT GTC TTT GAG ACA GTG GAA ATG CTA CCG TCC AAA 3311 
Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys 
1090 X095 1100 

. GTT GGA ATT TGG CGA ATA GAA TGC CTG ATT GGC GAG CAC CTG CAA GCT 33 5 9 

Val Gly He Trp Arg He Glu Cys Leu He Gly Glu His Leu Gin Ala . 
1105 1110 1115 

GGG ATG AGC . ACG ACT TTC CTG GTG TAC AGC AAG GAG TGT CAG GCT CCA 340 7 

Gly Met Ser Thr Thr Phe Leu Val Tyr Ser Lys Glu Cys Gin Ala Pro 
H20 H25 1130 1135 

CTG GGA ATG GCT TCT GGA CGC ATT AGA GAT TTT CAG ATC ACA GCT TCA 345 5 

Leu Gly Met Ala Ser Gly Arg He Arg Asp Phe Gin He Thr Ala Ser 
1140 1145 1150 

GGA CAG TAT GGA CAG TGG GCC CCA AAG CTG GCC AGA CTT CAT TAT TCC 3503 
Gly Gin Tyr Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser 
1155. 1160 H65 

GGA TCA ATC . AAT GCC TGG AGC ACC AAG :GAT CCC CAC TCC. TGG ATC AAG 3551 
Gly Ser He Asn Ala Trp Ser Thr Lys Asp Pro His Ser- Trp He Lys 
1170 1175 1180 

GTG GAT CTG TTG GCA CCA ATG ATC ATT CAC GGC ATC ATG ACC CAG GGT- 3599 
Val Asp Leu Leu Ala Pro Met He He His Gly He Met Thr Gin Gly 
1185 1190 1195 
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GCC CGT CAG AAG TTT TCC AGC CTC TAC ATG TCC CAG TTT ATC ATC ATG 364 7 

Ala Arg Gin Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He Met 
1200 1205 1210 1215 

TAC AGT CTT GAC GGG AGG AAC TGG CAG AGT TAC CGA GGG AAT TCC ACG 36 95 

Tyr Ser Leu Asp Gly Arg Asn Trp Gin Ser Tyr Arg Gly Asn Ser Thr 
- 1220 1225 1230 

GGC ACC TTA ATG GTC TTC TTT GGC AAT GTG GAC GCA TCT GGG ATT AAA 3743 
Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp Ala Ser Gly He Lys 
1235 1240 1245 

CAC AAT ATT TTT AAC CCT CCG ATT GTG GCT CGG TAC ATC CGT TTG CAC 3 791 

His Asn He Phe Asn Pro Pro He Val Ala Arg Tyr He Arg Leu His 
1250 1255 1260 

CCA ACA CAT TAC AGC ATC CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC 383 9 

Pro Thr His Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly 
1265 1270 1275 

TGT GAT TTA AAC AGT TGC AGC ATG CCC CTG GGA ATG CAG AAT AAA GCG 388 7 

Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Gin Asn Lys Ala 
1280 1285 1290 1295 

ATA TCA GAC TCA CAG ATC ACG GCC TCC TCC CAC CTA AGC AAT ATA TTT 3 935 

He Ser Asp Ser Gin He Thr Ala Ser Ser His Leu Ser Asn He Phe 
1300 1305 * 1310 

GCC ACC TGG TCT CCT TCA CAA GCC CGA CTT CAC CTC CAG GGG CGG ACG 3 983 

Ala Thr Trp Ser Pro Ser Gin Ala Arg Leu His Leu Gin Gly Arg Thr 
1315 1320 1325 

AAT GCC TGG CGA CCC CGG GTG AGC AGC GCA GAG GAG TGG CTG CAG GTG 4031 
Asn Ala Trp Arg Pro Arg Val Ser Ser Ala Glu Glu Trp Leu Gin Val 
1330 1335 1340 

GAC CTG CAG AAG ACG GTG AAG GTC ACA GGC ATC ACC ACC CAG GGC GTG 4 07 9 

Asp Leu Gin Lys Thr Val Lys Val Thr Gly lie Thr Thr Gin Gly Val 
1345 1350 1355 

AAG TCC CTG CTC AGC AGC ATG TAT GTG AAG GAG TTC CTC GTG TCC AGT 412 7 

Lys Ser Leu Leu Ser Ser Met Tyr Val Lys Glu Phe Leu Val Ser Ser 
1360 1365 1370 1375 

AGT CAG GAC GGC CGC CGC TGG ACC CTG TTT CTT CAG GAC GGC CAC ACG 4175 
Ser Gin Asp Gly Arg Arg Trp Thr Leu Phe Leu Gin Asp Gly His Thr 
1380 1385 1390 

AAG GTT TTT CAG GGC AAT CAG GAC TCC TCC ACC CCC GTG GTG AAC GCT 422 3 

Lys Val Phe Gin Gly Asn Gin Asp Ser Ser Thr Pro Val Val Asn Ala 
1395 1400 1405 

CTG GAC CCC CCG CTG TTC ACG CGC* TAC CTG AGG ATC CAC CCC ACG AGC 4271 
Leu Asp Pro Pro Leu Phe Thr Arg Tyr Leu Arg He His Pro Thr Ser 
1410 1415 1420 
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TGG GCG CAG CAC ATC GCC CTG AGG CTC GAG GTT CTA GGA TGT .GAG GCA 4319 
Trp Ala Gin His He Ala Leu Arg Leu Glu Val Leu Gly Cys Glu Ala 
1425 1430 1435 

CAG GAT CTC TAC TGA 4334 

Gin Asp Leu Tyr * 

1440 



(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Gin Leu Glu Leu Ser Thr Cys Val Phe Leu Cys Leu Leu Pro Leu 

1 5 . -10 , . 15 

Gly Phe Ser Ala He Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

Trp Asp Tyr Arg 'Gin Ser Glu Leu Leu Arg Glu Leu His Val Asp Thr 
35 40 45 

Arg Phe Pro Ala Thr Ala Pro Gly Ala Leu Pro Leu Gly Pro Ser Val 
50 55 60 

Leu Tyr Lys Lys Thr Val Phe Val Glu Phe Thr Asp Gin Leu Phe Ser 
65 70 75 80 



Val Ala Arg Pro Arg 

85 

Gin Ala Glu Val Tyr 
100 

Ser His Pro Val Ser 
115 

Ser Glu Gly Ala Glu 
130 

Asp Asp Lys Val Leu 
145 - . 

Leu Lys Glu Asn Gly 
165 

Ser Tyr Leu Ser His 
180 



Pro Pro Trp Met Gly Leu 
90 

Asp Thr Val Val Val Thr 
105 

Leu His Ala Val Gly Val 
120 

Tyr Glu Asp His Thr Ser 
135 

Pro Gly Lys Ser Gin Thr 
150 155 

Pro Thr Ala Ser Asp Pro 
- 170 

Val Asp Leu Val Lys Asp 
185 



Leu Gly Pro Thr He 
95 

Leu Lys Asn Met Ala 
110 

Ser Phe Trp Lys Ser 
125 

Gin Arg Glu Lys Glu 
14 0 

Tyr Val Trp Gin Val 
160 

Pro Cys Leu Thr Tyr 
175 

Leu Asn Ser Gly Leu 
190 
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lie Gly Ala Leu 
195 

Thr Gin Asn Leu 
210 

Gly Lys Ser Trp 
225 

Asp Pro Ala Pro 



Tyr Val Asn Arg 
260 

Val Tyr Trp His 
275 

lie Phe Leu Glu 
290 

Ser Leu Glu lie 
305 

Met Asp Leu Gly 



His Gly Gly Met 
340 



Leu Val Cys Arg 
200 

His Glu Phe Val 
215 

His Ser /Via Arg 
230 

Ala Arg Ala Gin 
245 

Ser Leu Pro Gly 



Val lie Gly Met 
280 

Gly His Thr Phe 
295 

Ser Pro Leu Thr 
310 

Gin Phe Leu Leu 
325 

Glu Ala His Val 



Glu Gly Ser Leu 



Leu Leu Phe Ala 
220 

Asn Asp Ser Trp 
235 

Pro Ala Met His 
250 

Leu lie Gly Cys. 
265 

Gly Thr Ser Pro 



Leu Val Arg His 
300 

Phe Leu Thr Ala 
315 

Phe Cys His lie 
330 

Arg Val Glu Ser 
345 



Thr Arg Glu Arg 
205 

Val Phe Asp Glu 



Thr Arg Ala Met 
240 

Thr Val Asn Gly 
255 

His Lys Lys Ser 
270 

Glu Val His Ser 
285 

His Arg Gin Ala 



Gin Thr Phe Leu 
320 

Ser Ser His His 
335 

Cys Ala Glu Glu 
350 



Pro Gin Leu Arg 
355 

Leu Tyr Asp Ser 
370 

Ser Pro Phe lie 
385 

Trp Val His Tyr 



Ala Val Pro Ser 
420 

Ser Gly Pro Gin 
435 

Ala Tyr Thr Asp 
450 

Ser Gly lie Leu 
465 

Leu lie lie Phe 



Arg Lys Ala Asp 
360 

Asp Met Asp Val 
375 

Gin lie Arg Ser 
390 

He Ser Ala Glu 
405 

Pro Ser Asp Arg 



Arg He Gly Arg 
440 

Val Thr Phe Lys 
455 

Gly Pro Leu Leu 
470 

Lys Asn Lys Ala 
485 



Glu Glu Glu Asp 



Val Arg Leu Asp 
380 

Val Ala Lys Lys 
395 

Glu Glu Asp Trp 
410 

Ser Tyr Lys Ser 
425 

Lys Tyr Lys Lys 



Thr Arg Lys Ala 
460 

Tyr Gly Glu Val 
475 

Ser Arg Pro Tyr 
490 



Tyr Asp Asp Asn 
365 

Gly Asp Asp Val 



His Pro Lys Thr 
400 

Asp Tyr Ala Pro 
415 

Leu Tyr Leu Asn 
430 

Ala Arg Phe Val 
445 

lie Pro Tyr Glu 



Gly Asp Thr Leu 
480 

Asn He Tyr Pro 
495 
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His Gly lie Thr Asp Val Ser Ala Leu His Pro Gly Arg Leu Leu Lys 
. 500 505 510 

Gly Trp Lys His Leu Lys Asp Met Pro He. Leu Pro Gly Glu Thr Phe 
515 . r 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys- Ser Asp 
530 535 540 



Pro Arg Cys Leu Thr. Arg Tyr Tyr Ser Ser Ser He Asn Leu -Glu Lys 

545 550 555 560 

Asp Leu Ala Ser Gly Leu lie Gly Pro Leu Leu He Cys Tyr Lys Glu 

565 570 575 



Ser Val Asp Gin Arg Gly Asn Gin 
580 

He Leu Phe Ser Val Phe Asp Glu 
595 600 

Asn He Gin Arg Phe Leu Pro Asn 
610 615 

Pro Glu Phe Gin Ala Ser Asn He 
625 630 



Met Met Ser Asp Lys Arg Asn Val 
585 590 

Asn Gin Ser Trp Tyr Leu Ala Glu 
605 

Pro Asp Gly Leu Gin Pro Gin Asp 
620 

Met His Ser He Asn Gly Tyr Val 
635 640 



Phe Asp Ser Leu 



Tyr He Leu Ser 
660 

Ser Gly Tyr Thr 
675 

Leu Phe Pro Phe 
690 

Gly Leu Trp Val 
705 

Met Thr Ala Leu 



Tyr Tyr Asp Asn 
740 

Lys Asn Val lie 
755 : 

Glu Glu Asp Lys 
770 



Gin Leu Ser Val 
645 

Val Gly Ala Gin 



Phe Lys Hi s Lys 
680 

Ser Gly Glu Thr 
695 

Leu Gly Cys His 
710 

Leu Lys Val Tyr 
725 

Thr Tyr Glu Asp 



Glu Pro . Arg Asp 
760 

Met Asp Tyr Asp 
775 



Cys Leu His Glu 
650 

Thr Asp Phe Leu 
665 

Met Val Tyr Glu 



Val Phe Met Ser 
700 

Asn Ser Asp Leu 
715 

Ser Cys Asp Arg 
730 

He Pro Gly Phe 
745 

He: Ser Leu Pro 



Asp He Phe Ser 
780 



Val Ala Tyr Trp 
655 

Ser Val Phe Phe 
670 

Asp Thr Leu Thr 
685 

Met Glu Asn Pro 



Arg Asn Arg Gly 
720 

Asp He Gly Asp 
735 

Leu Leu Ser Gly . 
750 

Thr Phe Gin, Pro 
765 

Thr Glu Thr Lys 



Gly Glu Asp Phe Asp He Tyr Gly Glu Asp Glu Asn Gin Asp Pro Arg 
785 790 795 800 
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Ser Phe Gin Lys Arg Thr Arg His Tyr Phe lie Ala Ala Val Glu Gin 
805 810 815 

Leu Trp Asp Tyr Gly Met Ser Glu Ser Pro Arg Ala Leu Arg Asn Arg 
820 825 830 

Ala Gin Asn Gly Glu Val Pro Arg Phe Lys Lys Val Val Phe Arg Glu 
835 840 845 

Phe Ala Asp Gly Ser Phe Thr Gin Pro Ser Tyr Arg Gly Glu Leu Asn 
850 855 860 

Lys His Leu Gly Leu Leu Gly Pro Tyr" lie Arg Ala Glu Val Glu Asp 
865 870. 875 880 

Asn lie Met Val Thr Phe Lys Asn Gin Ala Ser Arg Pro Tyr Ser Phe 
885 890 895 

Tyr Ser Ser Leu lie Ser Tyr Pro Asp Asp Gin Glu Gin Gly Ala Glu 
900 905 910 

Pro Arg His Asn Phe Val Gin Pro Asn Glu Thr Arg Thr Tyr Phe Trp 
915 920 925 

Lys Val Gin His His Met Ala Pro Thr Glu Asp Glu Phe Asp Cys Lys 
930 935 940 

Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser 
945 950 955 960 

Gly Leu lie Gly Pro Leu Leu lie Cys Arg Ala Asn Thr Leu Asn Ala 
965 970 975 

Ala His Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr 
980 985 990 

lie Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Val Glu Arg 
995 1000 1005 

Asn Cys Arg Ala Pro Cys His Leu Gin Met Glu Asp Pro Thr Leu Lys 
1010 1015 1020 

Glu Asn Tyr Arg Phe His Ala lie Asn Gly Tyr Val Met Asp Thr Leu 
1025 1030 1035 1040 

Pro Gly Leu Val Met Ala Gin Asn Gin Arg lie Arg Trp Tyr Leu Leu 
1045 1050 1055 

Ser Met Gly Ser Asn Glu Asn lie His Ser lie His Phe Ser Gly His 
1060 1065 1070 

Val Phe Ser Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Val Tyr Asn 
1075 1080 1085 

Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Val 
1090 1095 1100 
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Gly He Trp Arg-lle Glu Cys Leu lie Gly Glu * His Leu Gin Ala Gly 
H05 mo 1115 1120 

Met Ser Thr Thr Phe Leu Val Tyr Ser Lys Glu Cys Gin Ala Pro Leu 
1125 1130 H35 

Gly Met Ala Ser Gly Arg He Arg Asp Phe Gin He Thr Ala Ser Gly 
1140 1145 1150 

Gin Tyr Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly 
1155 H60 1165 

Ser He Asn Ala Trp Ser Thr Lys Asp' Pro His Ser Trp lie Lys Val 
1170 1175 H80 

Asp Leu Leu Ala Pro Met He He His Gly He Met Thr Gin Gly Ala 
1185 1190 1195 1200 

Arg Gin Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr 
1205 1210 1215 

Ser Leu Asp Gly Arg Asn Trp Gin Ser Tyr Arg Gly Asn Ser Thr Gly 
1220 1225 1230 

Thr Leu Met Val Phe Phe Gly Asn Val Asp Ala Ser Gly lie Lys His 
1235 1240 1245 

Asn He Phe Asn Pro Pro He Val Ala Arg Tyr He Arg Leu His Pro 
1250 1255 1260 

Thr His Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys 
1265 1270 1275 1280 

Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Gin Asn Lys Ala He 
1285 1290 1295 

Ser Asp Ser Gin He Thr Ala Ser Ser His Leu Ser Asn lie Phe Ala 
1300 1305 1310 

Thr Trp Ser Pro Ser Gin Ala Arg Leu His Leu Gin Gly Arg Thr Asn 
1315 1320 1325 

Ala Trp Arg Pro Arg Val Ser Ser Ala Glu Glu Trp Leu Gin Val Asp 
1330 1335 1340 

Leu Gin Lys Thr Val Lys Val Thr Gly He Thr Thr Gin Gly Val Lys 
1345 1350 1355 1360 

Ser Leu Leu Ser Ser Met Tyr Val Lys Glu Phe Leu Val Ser Ser Ser 
1365 1370 1375 

Gin Asp Gly Arg Arg Trp Thr Leu Phe Leu Gin Asp Gly His Thr Lys 
1380 1385 1390 

Val Phe Gin Gly Asn Gin Asp Ser Ser Thr Pro Val Val Asn Ala Leu 
1395 1400 1405 
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Asp Pro Pro Leu Phe Thr Arg Tyr Leu Arg lie His Pro Thr Ser Trp 
1410 . 1415 1420 

Ala Gin His lie Ala Leu Arg Leu Glu Val Leu Gly Cys Glu Ala Gin 
1425 1430 1435 1440 

Asp Leu Tyr * 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME / KEY : Peptide 

(B) LOCATION: 1..19 

<D) OTHER INFORMATION: /note= "Signal peptide of human 
Factor VIII." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
15 10 15 

Cys Phe Ser 
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